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This  study  examines  the  effect  of  two  format  variables, 
side  of  topic  and  speaker  position,  and  three  rater 
variables,  sex,  prestige,  and  proximity,  on  the  outcome 
of  intercollegiate  debates.   The  data  were  gathered  from 
three  different  debate  tournaments  covering  a  five  year 
period.   In  all,  the  data  pool  consisted  of  two  thousand, 
five  hundred  and  seventy  different  debates.   Using  Menden- 
hall's  procedure  for  achieving  limits  on  the  errors  of  esti- 
mation, samples  were  drawn  from  the  data  pool  and  submitted 
to  analysis  by  the  University  of  Florida  Computing  Center, 
using  the  BMDX63  "Multivariate  Linear  Hypothesis"  program. 
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The  results  of  the  regression  analysis  indicate 
that  the  format  variables  had  no  significant  effect  on 
the  outcome  of  debates  studied. 

However,  in  the  case  of  the  rater  variables   studied, 
significant  regression  effects  were  found.   In  general, 
male  debaters  had  a  higher  expectation  of  winning  any 
given  debate,  and  received  higher  speaker  ratings  than 
did  female  debaters.   Interaction  effects  between  the 
sex  of  the  debater,  the  sex  of  the  debater's  colleague, 
and  the  sex  of  the  judge  were  observed. 

Studies  of  the  rater  variable  prestige  indicated 
that  high  prestige  teams  receive  higher  scores  on  speaker 
rating,  team  rating,  and  speaker  ranking  than  do  either 
low  prestige  or  junior  college  teams.   Again,  interaction 
effects  between  the  prestige  of  the  debaters,  the 
prestige  of  their  opponents,  and  the  prestige  of  the 
judge  were  observed. 

Studies  of  the  rater  variable  proximity  revealed 
that  teams  farthest  from  the  judge  geographically  have  a 
higher  expectation  of  winning  and  receive  higher  scores 
than  their  opponents. 

The  results  of  this  study  allowed  the  development  of 
a  number  of  prediction  equations  whereby  outcome  on  any  of 
the  A.F.A.  Form  "C"  ballot  measures  may  be  predicted  on 
the  basis  of  the  rater  variables. 
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CHAPTER   I 
STATEMENT  OF  THE  PROBLEM 

Intercollegiate  debate  operates  under  a  format  of 
rigidly  limited,  alternating  speeches  by  opposing 
two-man  teams.   The  teams  uphold  opposite  sides  of  a 
specified  resolution.   A  critic  judge  renders  a  decision 
in  favor  of  tiie  team  which  has  done  the  better  debating. 
The  process  is  not  intended  to  discover  "truth"  in  the 
resolution.   Rather  it  is  designed  as  a  neutral  framework 
within  which  the  debaters  may  display  their  argumentative 
skills. 

The  purpose  of  this  study  was  to  test  the  neutrality 
of  the  debate  format ,  and  to  determine  the  influence  of 
certain  non-ability  variables  on  the  decisions  rendered 
by  debate  judges.   The  procedure  for  the  study  was  a 
multi-variate  regression  analysis  of  data  obtained  from 
A.F.A.  Form  "C"  debate  ballots. 

There  seems  to  be  ample  basis  for  the  statement 
that  intercollegiate  forensics  faces  widespread  criticism 
from  various  positions  within  the  field  of  speech. 
Certainly  the  proceedings  at  the  Chicago  convention  of 
the  Speech  Communication  A.ssociation  and  the  AmericcUi 
Forensics  Association  support  this  observation.    In  the 


face  of  such  attacks,  members  of  the  forensics  profession 
have  responded  vigorously.   Numerous  articles  in  scholarly 
journals  as  v/ell  as  convention  programs  and",  addresses 

have  defended  the  legitimacy  of  forsnsics  withir:  the 

■  ■  ■  '\  .  2 
discipline  of  speech  and  its  value  in  higher  educarion. 

A  survey  of  those  efforts  at  defense,  however,  reveals 

that  they  all  assume  the  validity  of  the  cun-ent  practices 

with  forensics.   Only  occasionally  does  research  focus 

on  specific  problems  within  intercoilegiixe  forerisics, 

and  when  such  focus  does  occur,  the  studies  are  limited 

in  scope  and  in  application. 

Intercollegiate  forensics  in  the  19?0's  is  a  major 

academic  activity.   Vast  numbers  of  students  and  faculty 

personnel  are ' involved.   Large  amounts  of  mohey  are  expended 

to  travel  hundreds  of  thousands  of  miles  to  participate  in 

tournaments.   In  1972-73  there  were  over  300  different 

tournaments  held.   In  view  of  this,  one  would  assume  that 

the  basic  format  under  which  all  of  this  activity  was  " 

held  would  have  been  the  object  of  extensive  study.   It 

seems  reasonable  to  expect,  but  in  fact  that  is  not  the 

case.   Research  in  forensics  has  tended  for  the  most  part 

3       . 

to  concentrate  on  demographic  surveys  and  bitter  squabbles 

M- 
over  technical  points  of  argumentation.    While  not 

denying  the  worthiness  of  such  studies,  it  should  be 

pointed  out  that  they  do  not  deal  with  any  question  of 


validity  of  the  process  itself.   They  assume  that  the 
process  is  working  to  achieve  the  desired  results. 
Whether  warranted  or  not,  such  an  assumption  needs  to 
be  tested. 

It  seems  appropriate  at  this  point  to  make  some 
mention  of  the  vast  body  of  published,  and  graduate  research 
annually  reported  in  the  American  Forensics  Association 
bibliographies.   The  bulk  of  such  work  consists,  in  fact, 
of  historico-critical  studies  of  public  and  political 
debates  and  debaters.   The  next  largest  catagory  consists 
of  essays  on  the  theory  of  argumentation.   Only  a  small 
percentage  of  the  research  reported  each  year  is  of  a 

quantitative  nature,  and  only  a  minor  portion  of  that 

.5 
deals  with  intercollegiate  forensics. 

There  is  evidence  to  suggest  that  this  breakdown 

adequately  describes  the  entire  body  of  forensics  research. 

An  analysis  and  summary  of  graduate  research  in  debate 

g 
through  1950  reported  eighty-six  various  theses  and 

dissertations.   33%  of  these  were  historico-critical 

studies  or  political  debates  and  debaters.   An  additional 

18%  consisted  of  histories  of  debating  and  of  debate 

societies  at  various  institutions.   29%  were  descriptive 

studies  of  debate,  debate  texts,  and  debaters.   Two 

studies  dealt  with  the  philosophy  of  argumentation,  and 

five  dealt  with  the  personality  traits  of  debaters  — 

primarily  with  critical  thinking  ability.   Only  one 


graduate  study  prior  to  1950  attempted  to  measure,  even 
indirectly,  the  balance  of  the  traditional  debate  format. 
Cromwell  in  19H9  examined  order  effect  and  discovered 

no  difference  in  audience  effectiveness  attributable  to 

7 

the  order  in  which  two  opposing  speeches  were  presented. 

The  most  sizable  group  of  studies  which  attempt  to 
identify  and  analyze  variables  effecting  the  outcome  of 
intercollegiate  debates  focus  on  ability  variables. 
These  may  be  defined  as  those  factors  commonly  recognized 
to  be  legitimate  bases  for  the  awarding  of  decisions  in 
competitive  debates.   They  include  analysis,  argument, 
reasoning,  case  and  organization,  evidence,  refutation, 
and  delivery ;  the  exact  list  being  dependent  on  the 
actual  ballot  in  use. 

Past  studies  have  utilized  survey  sampling  to  determine 
which  of  these  ci-iteria  were  weighed  most  heavily  by 

o 

judges  in  making  their  decisions.   The  use  of  evidence  in 
debate  has  long  been  an  object  of  study.   Several 

researchers  have  considered  the  role  of  evidence  in  the 

9  ... 

theory  of  argumentation.   The  use  of  evidence  in  inter- 
collegiate debate  has  been  examined  by  a  number  of 
researchers.   Studies  dealt  with  such  factors  as  frequency 

of  use,  types  of  evidence  used,  and  accuracy  in  the 

10 
citation  of  evidence. 

The  relationship,  if  any,  between  debate  and  critical 

thinking  ability  has  received  v/ide  study,  with  various 


conflicting  results,   Phillips,  in  196  2,  could  find  no 
evidence  that  participation  in  debatie  actually  improved 

critical  thinking  ability.    He  explained  differences 

12 
between  his  findings  and  those  of  Howell  and  of  Brembeck 

as  caused  by  the  fact  that  intercollegiate  debaters  tend 

initially  to  rank  above  the  average  in  critical  thinking 

ability.   Phillips'  findings  jwere  disputed  by  Faules, 

whose  196  7  study  argued  that  skill  in  refutation  and 

critical  thinking  ability  were  manifestations  of  the  same 

.13 
trait. 

The  influence  of  refutation  skills  on  success  in 

14 
debate  was  the  subject  of  study  for  Keeling  in  195  9. 

He  discovered  evidence  to  suggest  that  refutation  skill 

might  well  be  sufficiently  significant  to  serve  as  a 

predictor  for  debate  effectiveness. 

This  finding  was  generally  supported  by  the  results 
obtained  by  Wise  in  19  71.    Through  examination  of 
debate  ballots,  V/ise  discovered  various  relationships 
between  scores  on  selected  ballot  traits  and  win-lose 
results  for  a  given  debate  round.   Specifically,  the 
category  of  "analysis"  for  negative  speakers  and  that  of 
"refutation"  for  affirmative  speakers  was   found  to  be 
crucial  in  determining  win-lose. 

An  interesting  extension  of  his  examination  by  Wise 
revealed,  first,  that  affirmative  speakers  tend  to  score 
higher  than  their  negative  opponents,  even  though  the 


win-lose  record  is  balanced  between  affirmative  and 

16 
negative.    Wise  explained  this  difference  on  the  basis 

of  consistently  higher  affirmative  scores  on  the  ballot 

categories  of  "organization"  and  "delivery."   In  his 

sample,  this  held  true  for  both  winning  affirmatives 

compared  to  winning  negatives  as  well  as  losing  affirmatives 

compared  to  losing  negatives. 

These  data  suggested  the  need  for  further  study  to 
determine  whether  or  not  the  present  rating  system  does  in 
fact  provide  a  balanced  opportunity  for  negative  as  v/ell 
as  affirmative  speakers.   If  the  trend  reported  by  VJise 
should  be  substantiated,  it  would  clearly  establish  the 
necessity  for  some  revision  of  the  present  system  where 
each  speaker,  affirmative  and  negative,  is  scored  on  a 
5-30  point  scale. 

In  addition  to  those  descriptive  studies  which  set 
out  to  categorize  debaters  in  terms  of  certain  personality 
traits ,  there  have  been  a  small  number  of  studies  which 
have  attempted  to  test  statistically  the  relationship 
between  certain  traits  and  success  in  debate.   Two  in 
particular  are  worth  mentioning.   Allen,  in  196  3,  studied 

the  variables  of  interpersonal  and  concept  compatibility 

17 
as  they  related  to  the  success  of  a  debate  team.    He 

reported  that  interpersonal  compatibility  between  members 

of  a  debate  team  resulted  in  more  efficient  as  well  as 

more  effective  case  construction,  while  concept 


compatibility  resulted  only  in  greater  efficiency.   These 
data  did  not  reveal  that  student  satisfaction  with  the 
debate  experience  was  any  greater  as  a  result  of  pairing 
with  a  compatible  colleague, 

Willmington,  in  1969,  tested  the  relationships 

betv;een  knowledge  of  debate  theory,  amount  of  previous 

18 

debate  experience,  and  debate  effectiveness.    His 

results  indicated  that  the  only  significant  predictor  of 
success  would  be  amount  of  previous  debate  experience. 

V/hile  each  of  the  studies  cited  above  provides  worth- 
while data,  it  should  be  clear  at  this  point  that  none  of 
them  directly  approached  the  study  of  the  internal  validity 
of  the  debate  process.   Only  Wise  provides  any  evidence 
as  to  the  adequacy  of  balance  in  the  currently  used 
10-5  format,  and  this  was  only  a  minor  and  indirect  result 
of  V/ise's  study.   No  evidence  has  yet  been  presented  on 
the  issue  of  judge  reliability,  or  even  of  judging  bias. 
One  might  infer  from  this  negative  evidence  that  the 
persons  who  are  involved  in  debate  are  relatively  well 
satisfied  with  the  way  in  which  the  activity  is  currently 
structured.   This  is  simply  not  the  case.   A  number  of 
studies  suggest  that  coaches  as  well  as  student  debaters 
frequently  question  the  accuracy  of  decisions  rendered 
by  the  judge  of  a  particular  round.   Kruger  and  Hufford 

have  examined  the  question  of  the  consistency  of  decisions 

19 
across  judges.    Two  researchers  reported  results  which 


might  lead  to  methods  for  checking  the  accuracy  with  which 

20  . 

judges  determine  win-lose  decisions.    Other  studies  have 

focused  on  the  qualifications  of  judges  as  well  as  the 

21 
methods  by  which  they  arrived  at  decisions. 

All  of  these  studies  support  the  assertion  that 

doubts  exist  concerning  debate  judging.   Recently  two 

studies  have  attempted  to  resolve  those  doubts.   Brooks, 

in  1971,  focused  on  the  phenomenon  of  geographical  distance 

22 

bias  in  debate  decisions.    As  Brooks  explained,  the 

proximity  of  the  judge's  school  to  their  opponent's 
school  is  an  excuse  frequently  offered  by  losing  debaters. 
The  study  reported  plotted  the  results  of  six  mid-western 
debate  tournaments  in  terms  of  wins  by  nearest  or  farthest 
teams.   At  least  a  10%  difference  in  mileage  was  established 
as  the  minimum  definition  of  difference  in  distance,  and 
those  teams  not  meeting  this  definition  were  defined  as 
equidistant  from  the  judge's  school  and  excluded  from  this 
study.   Using  a  Chi-Square  statistic  with  a  .05  level  of 
confidence,  Brooks  found  that  in  5  of  the  6  tournaments 

studied,  the  team  geographically  nearest  to  the  judge's 

24 
school  won  the  most  debates.     Brooks'  results  are 

extremely  significant  in  that  they  suggest  that  tournament 

evaluation  may  not  be  strongly  related  to  the  stated 

criteria  under  which  decisions  are  supposed  to  be  made. 


However,  this  study  does  not  extend  far  enough. 
First,  it  fails  to  consider  the  possibility  of  effects 
due  to  social  distance.   In  intercollegiate  debate, 
social  distance  v;ould  manifest  itself  as  a  prestige  factor 
generated  by  the  quality  of  the  debating  performed,  and 
the  academic  and  forensics  reputations  of  the  schools 
represented.   Also  involved  would  be  the  experience  and 
reputation  of  the  individual  students  and  judge  present. 

The  division  of  the  country  into  various  districts 
for  purposes  of  selection  to  the  National  Debate  Tournament 
introduces  another  type  of  "distance"  which  Brooks  does 
not  consider.   Schools  normally  do  a  major  portion  of  their 
season's  debating  within  their  N.D.T.  district,  thus 
potentially  fostering  "friendship  through  propinquity" 
as  Brooks  suggests.   However,  competition  for  N.D.T. 
invitations  is  quite  intense,  and  rivalries  are  established 
which  might  well  over-ride  such  friendship.   All  of  these 
points  suggest  the  need  for  further  examination  of  the 
variable  of  distance  as  it  affects  the  outcome  of 
intercollegiate  debates. 

The  second  study  of  interest  was  reported  by  Hayes 

25 
and  McAdoo  in  187 2.    Stimulated  by  the  work  of  Hensley 

and  Strother,^^  Hayes  and  McAdoo  set  out  to  examine  the 

variable  of  sex  as  it  affected  evaluation  in  intercollegiate 

debate.   The  study  utilized  data  generated  by  speaker 

rankings  rather  than  the  win-lose  results  employed  by 


10 


Hensley  and  Strother  in  an  attempt  to  isolate  individual 
rather  than  team  evaluations.   A  Chi-Square  test  found 
significant  deviation  from  expected  results  at  a  .01 
level  of  confidence,  and  the  direction  of  the  results 
indicated  that  female  debaters  in  the  sample  were 
evaluated  more  highly  than  were  male  debaters. 

There  are  a  number  of  limiting  factors  in  the  work 
reported  by  Hayes  and  McAdoo.   First  was  the  method  by 
which  they  collected  their  sample.   The  ballots  studied 
included  all  of  those  accumulated  by  three  different 
college  debate  programs  over  a  three-year  span.   Obviously 
this  resulted  in  multiple  measures  of  the  seime  female  and 
male  debaters.   While  this  procedure  may  give  conclusive 
evidence  of  the  superiority  of  female  debaters  at  the 
schools  involved,  there  is  no  basis  for  generalization  to 
the  entire  population  of  female  debaters.   Second,  the 
experimenters  excluded  from  the  data  pool  all  ballots  in 
vvhich  the  competition  consisted  of  all  men  or  all  women. 
Thus  the  question  of  interaction  between  male  and  female 
debaters  is  raised.   Finally,  the  study  made  no  distinctions 
as  to  the  sex  of  the  judge,  thus  allowing  forthe  possibility 
of  yet  another  contaminating  variable;  i.e.   interaction 
between  the  sex  of  the  judge  and  sex  of  the  debater 
evaluated. 

On  the  basis  of  the  studies  examined  above,  it  is 
suggested  that  there  is  a  need  for  systematic  examination 
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into  non-ability  variables  which  may  affect  the  outcome 
of  intercollegiate  debates  and  the  evaluations  received  by 
the  students  participating  in  those  debates. 

An  examination  of  the  A. F. A.  Form  "C"  ballot  indicates 
that  included  among  the  data  provided  by  this  instrument 
is  the  following  descriptive  and  evaluative  information: 

Descriptive: 

1)  The  name  of  the  judge 

2)  The  cux^rent  school  affiliation  of  the  judge 

3)  The  schools  represented  in  the  round 
H)   The  names  of  the  individual  debaters 

Evaluative : 

1)  The  side  (and  school)  which  won  the  round 

2)  Ratings  for  each  of  the  four  debaters  on  a 
1-5  scale  on  6  traits 

3)  Rankings  of  the  four  debaters  from  1st  to 
Uth 

4)  Ratings  for  each  team  on  a  scale  of  1-5 
points 

In  addition  to  the  ability  variables  measured  by  the 
evaluative  data  on  the  ballot,  it  is  possible  to  isolate 
the  following  non-ability  variables. 

Variable  ^1  -  Side  of  the  topic 

The  format  of  intercollegiate  debate  is  deliberately 

designed  to  eliminate  any  advantage  accruing  to  either  the 

27     . 

affirmative  or  negative  side  within  the  debate.    While 

the  affirmative  carries  the  burden  of  proof,  it  is  given 
■Che  right  to  select  the  specific  areas  in  which  to  argue 
the  debate.  The  negative  enjoys  the  presumption,. but  is 
required  to  respond  directly  to  the  affirmative  analysis. 
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The  affirmative  is  given  the  first  and  last  speeches,  but 
the  negative  has  a  fifteen-minute  block  in  the  middle  of 
the  debate.   Actual  speaking  time  is  balanced  between 
the  two  sides.   Normally,  teams  "switch  sides"  from  affirma- 
tive to  negative  and  back  in  the  course  of  a  tournament, 
but  an  equal  number  of  affirmative  and  negative  rounds  is 
not  required.   Tournament  team  awards  are  based  on  total 
win-loss  records,  and  individual  awards  on  cumulative  speaker 
points,  with  no  consideration  as  to  the  side  of  the  topic 
on  which  the  team  or  student  debated.   All  of  these  prac- 
tices depend  for  their  validity  on  the  assumption  that  there- 
is  no  inherent  advantage  in  either  the  affirmative  or  nega- 
tive side.   Yet  systematic  research  on  the  point  is  not 
available.   Some  test  of  the  assumption  of  balance  between 
sides  seems,  thei-efore,  to  be  desirable.   In  order  to  examine 
the  assumption,  the  following  research  hypotheses  will  be 
utilized: 

1.1  There  is  no  effect  due  to  side  in  the  win-loss 
decisions  of  intercollegiate  debate. 

1.2  There  is  no  effect  due  to  side  in  the  individual 
speaker  ratings  in  intercollegiate  debates. 

1.3  There  is  no  effect  due  to  side  in  the  team 
ratings  in  intercollegiate  debates. 

Variable  §2    -   Speaker  Position 

While  the  theory  of  competitive  debate  assigns  to 
each  speaker  within  the  debate  specific  duties  and 
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responsibilities,  there  is  no  consensus  that  any  of 
these  duties  is  significantly  more  difficult  than  any 
other,   V/hile  the  first  affirmative  constructive  speech 
is  prepared  in  advance,  thus  freeing  that  debater  from 
the  demands  of  extemporaneous  adaptation,  the  first 
affirmative  rebuttal  speech  is  generally  considered  to  be 
the  most  difficult  of  Ihe  eight  speeches.   Again,  the 
format  seeks  balance  between  the  speakers.   On  the 
assumption  that  this  balance  is  achieved,  all  four 
speakers  are  rated  on  the  same  5-30  point  scale,  and  are 
compared  to  each  other  in  a  forced  choice  ranking  from  1st 
to  4th.   Moreover,  individual  speaker  awards  at  tournaments 
are  made  from  the  entire  pool  of  participants,  with  no 
consideration  given  to  the  order  in  which  the  students 
spoke.   Previously  cited  research  by  Wise  raised  questions 
against  the  assumed  balance  between  speaker  positions. 
In  order  to  validate  that  assumption,  the  following 
research  hypotheses  will  be  tested: 

2,1  There  is  no  effect  due  to  speaker  position 
in  the  rating  scores  of  affirmative 
debaters . 

2.2.  There  is  no  effect  due  to  speaker  position 
in  the  rating  scores  of  negative  debaters. 

2.3  There  is  no  effect  due  to  speaker  position 
in  the  rankings  of  affirmative  debaters* 
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2.4  There  is  no  effect  due  to  speaker  position 
in  the  rankings  of  negative  debaters. 

Variable  #3  -  Sex 

VJith  the  growth  of  co-education  in  institutions  of 
higher  learning,  women  have  participated  more  and  more 
in  debate  against  men.   The  tournament  with  separate 
men's  and  women's  divisions  is  rapidly  disappearing. 
Women  have  participated  in  the  National  Debate  Tournament 
and  have  compiled  outstanding  records.   Some  of  the 
outstanding  coaches  in  the  country  are  women.   Women  have 
served  as  president  of  the  major  forensics  honoraries, 
and  play  an  active  part  in  the  professional  societies. 
With  this  background,  one  would  have  no  reason  to  believe 
that  sex  might  serve  as  a  variable  affecting  the  outcome 
of  intercollegiate  debates.   Yet  many  coaches  do  hold  such 
a  belief.   Hayes  and  McAdoo  reported  that  "Many  debate 

coaches  throughout  the  country  hold  the  belief  that  male 

28 
debaters  are  superior  to  the  female  debater."    Hensley 

and  Strother  reported  results  indicating  that  sex  does 

affect  win-loss  decisions,  with  a  mixed  (i.e.  one  male, 

one  female)  team  standing  a  "greater  than  random  chance 

of  winning."    In  light  of  these  facts,  further  research 

seems  necessary.   Accordingly,  the  following  research 

hypotheses  will  be  tested: 
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3.1  There  is  no  effect  due  to  sex  in  win-loss 
decisions  in  intercollegiate  debates. 

3.2  There  is  no  effect  due  to  sex  in  speaker  ratings 
in  intercollegiate  debates. 

3.3.   There  is  no  effect  due  to  sex  in  speaker 
rankings  in  intercollegiate  debate. 

3.1+  There  is  no  effect  due  to  the  sex  of  the 
judge  in  win-loss  decisions  in  inter- 
collegiate debates. 

3.5  There  is  no  effect  due . to  the  sex  of  the 
judge  in  speaker  ratings  assigned  in 
intercollegiate  debates. 

3.6  There  is  no  effect  due  to  the  sex  of 
the  judge  in  speaker  rankings  assigned 
in  intercollegiate  debates. 

3.7  There  is  no  effect  due  to  an  interaction 
between  the  sex  of  the  judge  and  the  sex  of 
the  debaters  in  the  outcome  of  intercollegiate 
debates . 

Variable  ^U  -  Prestige 

The  subjective  nature  of  decision-making  in  inter- 
collegiate debate  has  frequently  led  to  questions 
concerning  the  impact  of  prestige  in  determining  those 
decisions.   The  tournament  circuit  generally  results  in 
frequent  contacts  between  teams,  and  a  widespread 
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knowledge  of  the  past  successes  of  any  team  or  teams. 
Do  judges  favor  certain  teams  over  others?   Or,  perhaps, 
teams  from  certain  schools?   The  problem  is  compounded 
by  the  fact  that  prestige  tends  to  be  generated  in 
forensics  circles  by  success,  and  is  not  necessarily 
related  to  the  general  academic  reputation  of  an 

institution  nor  to  the  grade  point  average  of  the  individual 

30 
debater.    How  much  of  a  debater's  success  is  attributable 

to  his  ability,  and  how  much  to  the  prior  reputation  with 

the  judgeXs)?  Hov;  much  of  the  success  which  teams  from 

a  given  school  achieve  year  after  year  is  due  to  an 

outstanding  forensics  program  turning  out  superior  teams? 

And  how  much  must  be  credited  to  the  "mantle  of  greatness" 

which  any  one  team  enjoys  because  of  the  achievements  of 

past  teams?  The  fact  that  these  are  difficult  questions 

to  answer  does  not  mean  that  they  should  be  avoided. 

The  first  step  in  searching  for  those  answers  requires 

that  some  definition  of  prestigeous  teams  be  formulated. 

For  purposes  of  this  study,  "high  prestige"  teams  will  be 

defined  as  those  representing  an  institution  which  attended 

the  National  Debate  Tournament  either  in  the  current  year 

or  the  year  immediately  preceding  the  one  in  which  the 

debate  studies  was  held.   "Low  prestige"  teams  will  include 

all  other  teams  representing  four-year  institutions. 

Junior  Colleges  will  be  considered  in  a  separate  category 
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because  of  the  widespread  assumption  that  debaters  from 
junior  colleges  have  less  ability  than  those  from  senior 
institutions. 

In  order  to  equalize  ability  of  the  debaters  included 
in  the  sample  insofar  as  possible,  the  hypotheses  testing 
variable  #4  will  be  tested  using  only  data  from  elimination 
rounds.   Because  teams  selected  to  participate  in 
elimination  rounds  must  qualify  in  multi-round  preliminaries, 
and  because  of  the  practice  of  power-matching  teams  in 
those  preliminary  rounds,  limiting  the  data  pool  to 
elimination  round  ballots  will  remove  from  consideration 
clearly  inferior  teams.   The  over-all  variance  in  team 
and  individual  ability  should  be  greatly  reduced  by  such 
a  prccedure,  thus  increasing  the  reliability  of  whatever 
test  results  are  obtained.   The  research  hypotheses  to  be 
employed  are: 

i+.l  There  is  no  effect  due  to  prestige  in  win-loss 

results  of  debates  between  senior  colleges. 
4.2  There  is  no  effect  due  to  prestige  in  the 

win-loss  results  of  debates  between 

"high  px'estige"  colleges  and  junior 

colleges. 
4.3.  There  is  no  effect  due  to  prestige  in  the 

win-loss  results  of  debates  between  "low 

prestige"  colleges  and  junior  colleges. 
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Variable  #5  —  Proximity 

As  was  discussed  above,  geographical  proximity  has 
frequently  been  suggested  as  a  variable  influencing  the 
outcome  of  intercollegiate  debates.   Brooks  offers 
evidence  to  suggest  that  judges  vote  more  often  in  favor 

of  teams  from  schools  nearest  to  their  own  than  they  do 

31 
for  teams  from  schools  further  away.    Such  a  trend, 

if  confirmed,  represents  a  serious  failing  in  the  present 

judging  system,  and  therefore  deserves  further  examination. 

An  additional  measure  of  proximity  might  be  provided 

by  considering  the  nine  N.D.T.  districts  into  which  the 

nation  is  divided.   Because  these  district  lines  tend  to 

represent  natural  lines  of  travel  and  traditional 

rivalries,  the  effects  due  to  simple  geographical 

proximity  might  well  be  over-ridden  by  the  pressures  of 

district  loyality.   A  judge  from  Mississippi,  for  example, 

belongs  to  N.D.T.  District  VI,  which  includes  Alabama,, 

Tennessee,  Kentucky,  Georgia,  Florida,  and  the  Carolinas. 

Such  a  judge  might  well  feel  more  affiliation  with,  or  be 

swayed  more  by  friendship  for,  a  team  from  North  Carolina 

than  for  a  team  from  Arkansas  or  Louisianna,  in  spite  of 

the  fact  that  he  is  geographically  closer  to  those  teams. 

The  strength  of  district  loyalties  is  demonstrated  by  the 

fact  that  the  National  Debate  Tournament  does  not  allow 

a  judge  to  judge  any  team  from  his  own  district  in 

competition  against  a  team  from  another  district.   The 
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testing  of  variable  #5,  then,  should  allow  for  the 
consideration  both  of  geographical  and  district  proximity. 
The  following  research  hypotheses  have  therefore  been 
framed: 

5.1  In  debates  where  both  teams  and  the  judge 
represent  the  same  N.D.T.  district,  there 
is  no  effect  due  to  geographical  proximity 
in  determining  the  result. 

5.2  In  debates  where  both  teams  represent  one 
N.D.T.  district  and  the  judge  represents 
another  district,  there  is  no  effect  due 
to  geographical  proximity  in  determining 
the  result. 

5.3  In  debates  where  each  team  and  the  judge  represent 
different  N.D^T.  districts,  there  is  no  effect 
due  to  geographical  proximity  in  determining 

the  result. 

5.4  In  debates  where  one  team  and  the  judge  represent 
the  same  N.D.T.  district,  and  the  second  team 
represents  another  district,  there  is  no 

effect  due  to  district  proximity  in 
determining  the  result. 
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CHAPTER   II 
METHODOLOGY 

Sampling  Procedures 

In  order  to  eliminate  possible  effects  due  to  a 
specific  debate  topic,  the  data  V7ere  collected  from 
tournaments  covering  a  five  year  period.^  The  experimental 
samples  were  stratified  between  the  five  years,  and  each 
stratum  was  stratified  between  debates  occuring  in  the 
first  half  of  the  tournament,  those  occuring  in  the  second 
half,  and  those  coming  during  the  elimination  rounds. 
In  both  cases,  the  weights  were  equalized  for  each  stratum. 
This  procedure  was  adopted  to  eliminate  possible  effects 
across  time,  either  between  or  within  tournaments. 

It  was  decided  to  accept  a  boundary  on  the  error  of 

estimation  of  no  more  than  one  point  on  the  5-30  point 

scale  for  speaker  ratings  in  the  estimation  of  sample 

means.   In  order  to  achieve  that  limit,  the  following 

2 
formula  for  determining  the  size  of  the  samples  was  used. 


n2  £_  +   i;  H  a.^ 
U    i=l 
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where:  N.  is  the  total  number  of  debates  in  stratum  i; 

a-      is  the  population  variance  of  stratum  i; 

N  is  the  total  number  of  debates ;  and 

w.  is  the  fraction  of  observations  allocated 
■^  to  stratum  i. 

Since  this  formula  requires  some  estimate  of  population 
variance  for  each  of  the  sample  strata,  a  1  in  20 
systematic  sample  with  N  =  200  was  drawn  from  the  data 
pool  to  provide  those  estimates. 

After  calculation  of  the  variance  for  each  of  the 
strata,  the  appropriate  sample  size  was  computed  to  be 
75.   Samples  for  the  study  were  drawn  from  the  data  pool 
by  a  1  in  30  systematic  sample  until  each  stratum  was  filled. 
The  allocation  of  the  total  sample  across  each  stratum 
was  derived  by  the  formula  n.  =  nw. » 

Dependent  Measures 

Examination  of  the  data  began  with  a  study  of  the  four 
dependent  measures  available  from  the  A.F.A.  Form  "C" 
ballot;  win-loss  decision,  speaker  rating,  team  rating,  and 
speaker  ranking.   Since  all  four  measures  are  reports  of 
the  same  event  (i.e.  one  debate),  the  assumption  was  that 
there  would  be  a  relationship  among  them.   The  exact  form 
of  that  relationship,  and  its  strength,  were  examined  to 
determine  if  a  more  parsimonious  statistical  procedure 
could  be  employed. 
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Specifically,  if  a  strong  positive  correlation  existed 
among  the  variables  speaker  ranking,  speaker  rating,  team 
rating,  and  winning,  the  testing  of  the  research  hypotheses 
might  be  collapsed  to  examine  that  one  of  the  dependent 
measures  which  accounted  for  the  greatest  portion  of  the 
variance.   Inferences  concerning  those  dependent  measures 
not  tested  directly  could  still  be  made  on  the  basis  of  the 
relationships  known  to  exist. 

In  order  to  determine  the  relationship  among  the 
available  measures  of  outcome  for  an  intercollegiate 
debate:   win-loss  decision;  speaker  rankings;  speaker  ratings; 
and  team  ratings;  the  data  were  submitted  to  botli 
correlation  and  regression  analysis. 

To  facilitate  sampling,  the  data  pool  was  arranged 
in  consecutive  order  of  rounds  and  chronological  order  of 
the  tournaments.   Two  samples,  each  with  N  =  200,  were 
selected  from  the  data  pool  through  a  1  in  20  systematic 
sample.   In  instances  where  this  procedure  produced  an 
incomplete  ballot,  the  selection  moved  backwards  one  step 
at  a  time  until  a  complete  ballot  was  encountered.   In 
each  case,  the  progression  continued  from  the  ballot 
actually  included  in  the  sample.   In  no  instance  did  this 
correction  procedure  require  moving  more  than  three 
ballots  away  from  that  indicated  by  the  1  in  20  progression. 
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The  data  from  the  first  sample  were  submitted  to 

3 
sis  by  the  BMD02D  program,   and 

correlation  matrix  shown  in  Table  I. 


3 
analysis  by  the  BMD02D  program,   and  produced  the 


TABLE   I 

Pearson  Product-Moment  Correlations  for  Dependent 
Measures  of  Outcome 


V/inning  Losing    Speaker   Team   Speaker 
Rating   Rating  Ranking 


Winning  1.0000   -1.0000    .38ii2    .2902  -.6243 

Losing  1.0000   -.3842   -.2902  .6243 

Speaker  rating  1.0000    .7592  -.4646 

Team  rating  '  1.0000  --2120 

Speaker  ranking  1.0000 


The  correlation  coefficients  were  tested  for  significant 
deviation  from  zero  using  a  form  of  the  "t"  test  adapted 
for  the  standard  deviation  of  "r" ,  the  estimated  value  of 
the  population  correlation  coefficient.   The  calculated 
value  of  the  test  statistic  "t"  for  each  correlation 
coefficient  and  the  interpretation  associated  with  that 
statistic  are  shown  in  Table  II. 
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TABLE   II 

Tests  for  Significance  of  Pearson 
Product -Moment   Correlations 


Variables 


Win/loss 
Win/Sp.  rating 
Win/ Team  rating 
V;in/  Sp .  ranking 


1] 


t-value 


interpretation 


-1.0000  undefined  significant  with  p<" 
.3842   18, H3    significant  with  p'^.Ol 
.2902   13.93     significant  with  p<.01 
-.6243  -35.46     significant  with  p<.01 


Lose/Sp.  rating 
Lose/team  rating 
Lose/Sp.  ranking 


-.3842  -18.43     significant  with  p<.01 

-.2  902  -13.93    significant  with  p<.Ql 

.6  243   35.46    significant  v;ith  p<,01 


Sp.  rating/team  rating   .7592   51.78    significant  with  p<.01 
Sp.  rating/Sp.  ranking  -.4646  -23.36    significant  with  p<.01 


Team  rating/Sp.  rank    -.2120  -12.66    significant  with  p<. 01 
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The  results  of  this  analysis  conformed  to  the  a 
priori  expectation  of  significant  correlations  based  on 
the  fact  that  all  four  dependent  variables  are  reports  of 
the  same  event.   The  absolute  negative  correlation  between 
variables  1  and  2  was,  of  course,  forced  by  the  mutually 
exclusive  definitions  of  those  two  nominal  variables.   The 
negative  correlations  between  speaker  ratings  and  team 
ratings  and  speaker  rankings  is  explained  by  the  opposing 
directions  of  the  scales  on  which  those  variables  are 
measured.   As  performance  improves,  speaker  and  team 
ratings  move  upward  on  scales  of  5-30  and  1-5  points 
respectively,  whereas  speaker  rank  moves  downward  toward 
1  on  a  scale  of  1  to  4,  - 

The  finding  of  a  significant  relationship  between  the 
four  available  dependent  measures  suggested  the  presence  . 
of  a  principal  component  effect  among  them.   If  true,  that 
principal  component  would  serve  as  a  common  "index  of 
outcome"  which  would  pool  the  information  contained  in  the 
four  dependent  measures.   The  search  for  a  principal  component 
was  begun  using  the  BMDX72  "Factor  Analysis"  program.   The 
data  from  the  second  sample  were  used  for  this  analysis, 
with  N  =  200. 

Factor  analysis  revealed  that  the  measures  of  outcome 
available  from  the  A.F.A.  Form  "C"  ballot  clustered  around 
four  factors  which,  taken  together,  accounted  for.  100%  of 
the  variance  in  the  original  four  dependent  measures. 
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However,  the  proportion  of  variance  accounted  for  was  not 
equally  distributed  among  the  four  factors.   A  principal 
component  was  discovered  which  accounted  for  some  60 
percent  of  the  total  variance.   A  second  factor,  which 
accounted  for  an  additional  25  percent  of  the  variance, 
also  emerged. 

The  output  of  BMDX72  also  included  a  definition  of 
both  factors  in  terms  of  their  linear  relationship  to  the 


four  original  measures. 

follows. 

Factor  one:   yl  = 
Factor  two:   y2  = 


Those  definitions  were  as 


.73x1  -  .37x2  -  .19x3  +  1.30x4 
.52x1  +  .39x2  +  .61x3  +  .49x4 
where  xl  was  a  dichotomous  variable  representing  win-loss; 

x2  was  speaker  rating; 

x3  was  team  rating;  and 

x4  was  speaker  ranking. 
i 
On  the  basis  of  this  analysis,  it  was  concluded  that 

the  use  of  Factor  one  as  the  "index  of  outcome"  would  be 

of  interest.   Because  it  represents  contributions  from  each 

of  the  four  dependent  measures,  the  factor  would  itself 

contain  more  information  than  any  one  of  the  four. 

In  all  subsequent  analyses,  the  data  V7ere  first 

submitted  to  the  BMDX7 2 ! program  in  order  to  generate 

"index  of  outcome"  scores  for  each  case.   Those  scores 

were  then  used  as  an  additional  response  measure  for 

testing  regression  by  the  dependent  variables. 
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Predictor  Variables 

The  method  chosen  for  the  testing  of  the  research 
hypotheses  was  an  adaptation  of  multi-variate  regression 
analysis,  using  "dummy"  variables  to  introduce  nominal  data 
into  the  regression  equation.   Since  the  variables  to  be 
tested  all  lay  outside  the  range  of  "ability"  variables 
as  defined  in  Chapter  I,  any  significant  apportioning 
of  variance  in  the  dependent  measure  to  a  variable  under 
study  was  taken  as  evidence  of  a  non-ability  effect. 

Regression  analysis  was  chosen  as  the  appropriate 
statistical  procedure  because  the  issue  of  importance 
was  the  estimation  of  relationships  among  variables 
influencing  the  outcome  of  intercollegiate  debates 
rather  than  measuring  the  strength  of  those  relationships. 

Actual  calculations  were  performed  by  the  University 
of  Florida  Computing  Center  using  the  BMDX6  3  program 
"Multivariate  General  Linear  Hypothesis"  developed  by 
the  U.C.L.A.  Health  Sciences  Computing  Facility.   The 
output  from  this  program  includes  regression  coefficients 
for  each  of  the  predictor  variables  in  the  model,  various 
cross-product  matrices,  and  appropriate  "F"  statistics 

with  associated  degrees  of  freedom  for  hypotheses 

5 
selected  by  the  experimenter. 

The  procedure  required  a  division  of  the  study  into 

two  steps.   Attention  was  first  given  to  the  format 

variables, "side  of  topic"  and  "speaker  position,"  in 
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order  to  determine  whether  or  not  blocking  for  the  effects 
of  those  variables  might  be  appropriate  when  examining 
the  rater  variables  "sex,"  "prestige,"  and  "proximity." 
The  general  form  of  the  model  tested  was 

7=3^  +  3-lX^  +  .  .  .  +  Bj^Xj^  +  e,  where 

y  was  the  index  of  outcome ; 

6   was  some  constant ; 
o 

X-,  .  .  .X,  were  the  non-ability  variables 
being  studied; 

B-,  .  .  .  B]^  were  the  regression  weights 
associated  with  each  variable ;   and 

£  represented  unapportioned  variance,  or 
error  in  the  model. 

Statistical  Models 

Examination  of  the  research  hypotheses  dealing  with 
the  effects  of  format  bias  on  the  dependent  measure 
index  of  outcome  utilized  the  following  model. 

Model  #1. 

y  =  b  +  b^xj^  +  b2X2  +  ^3X3  +  e,  where 

9  is  the  dependent  variable,  index  of  outcome; 

b  is  the  mean  score  for  2nd  negative  speakers; 
o 

b,  is  the  mean  score  for  1st  affirmative 
speakers ,  minus  b  ; 

bj  is  the  mean  score  for  2nd  affirmative 
speakers,  minus  b  ; 
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b-  is  the  mean  score  for  1st  negative  speakers, 
minus  b^; 

X-,  is  a  dummy  variable  for  1st  affirmatives  (1  if 
1st  affirmative,  0  otherwise); 

X   is  a  dummy  variable  for  2nd  affirmatives  (1  if 
2nd  affirmative,  0  otherwise);  and 

X-  is  a  dummy  variable  for  1st  negatives  (1  if 
1st  negative,  0  otherwise). 

The  model  was  tested  for  "goodness  of  fit"  using 
the  test  statistic 

MS  (Lack  of  Fit)  ^,   ^ 

~  MS  (Experimental  Error)    *p-(k+l),  n-p 

using  the  rejection  region 

F  >  F 

.05,  p-(k+l),  n-p 

Having  verified  that  the  derived  model  did  in  fact 

provide  a  good  fit  for  the  sample  data,' the  hypotheses 

were  tested  specifically  as  follov;s. 

Hypothesis  1.2;  There  is  no  effect  due  to  side  in  the 
individual  speaker  scores  in  intercollegiate  debate. 

"o^  h-    h-   h-   ^ 
H  :  at  least  one  6^  ^  0 

Hypothesis  2.1;   There  is  no  effect  due  to  speaker 
position  in  the  scores  of  affirmative  debaters. 

"o-  ^1  =  °     ^o^  ^2  =  ° 
H^;  6^  /  0     H^;  ^^    i    0 
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Hypothesis  2.2;   There  is  no  effect  due  to  speaker  position 
in  the  scores  of  negative  debaters. 

H  :  3q  =  0 
o   ^ 

11^:  $3  ;5  0 

All  tests  were  made  using  a  preset  value  of  <=  =  .05  and 
the  test  statistic 

.  ..  7 

MSR   p 

^  "  MSE    ^  1^  2 
As  the  examination  of  the  appropriate  research 
hypotheses  provided  no  evidence  of  an  effect  on  the 
outcome  of  debates  due  to  format  bias,  analysis  of  the 
data  was  continued  by  examining  the  effects  due  to  the 
three  rater  bias  variables  "sex,"  "prestige,"  and 
"proximity."   The  examination  of  research  hypothesis 
three,  dealing  with  the  effect  of  the  sex  of  the  debaters 
and  the  judge  on  the  index  of  outcome,  was  based  on  the 
followii-g  model. 

Model  #2. 

5  =  b  +  h^x^   +  h^x^   +  bgXg  +  b^Xj^  +  bsXg  +  e 

where  y  is  the  dependent  variable ,  index  of  outcome; 

b  is  the  mean  score  for  a  male  debater  with  a  male 
°  colleague  evaluated  by  a  male  judge; 

b  is  the  mean  score  for  a  male  debater  with  a 
■'•  female  colleague  evaluated  by  a  male  Dudge, 

minus  b^ ; 
o 
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b-  is  the  mean  score  for  a  male  debater  with  a 
male  colleague  evaluated  by  a  female  judge, 
minus  b  ; 

bo  is  the  mean  score  for  a  male  debater  with  a 
female  colleague  evaluated  by  a  female 
judge,  minus  b^; 

b  is  the  mean  score  for  a  female  debater  with 
a  female  colleague,  evaluated  by  a  male 
judge,  minus  b  ; 

br  is  the  mean  score  for  a  female  debater  with  a 
female  colleague ,  evaluated  by  a  female 
judge,  minus  b  ; 

X.  is  a  dummy  variable  (1  if  male  debater  with 
■"■  female  colleague  and  male  judge,  0  otherwise); 

X2  is  a  dummy  variable  (1  if  male  debater  with 
male  colleague  and  female  judge,  0  otherwise); 

x„  is  a  dummy  variable  (1  if  male  debater  with_ 

female  colleague  and  female  judge,  0  otherwise); 

Xn  is  a  dummy  variable  (1  if  female  debater  with  , 
female  colleague  and  male  judge,  0  otherwise); 
and 

Xr  is  a  dummy  variable  (1  if  female  debater  with 
female  colleague  and  female  judge,  0  otherwise). 

Having  verified  that  the  derived  m.odel  did  in  fact 

provide  a  good  fit  for  the  sample  data,  the  hypotheses 

were  tested  specifically  as  follows.   The  general 

hypothesis  of  some  effect  due  to  sex  was  of  the  form 

%''      h   =  ^2  =  33  =  64  =  65  =  0 

H  :   at  least  one  6^  ^  0. 

In  order  to  isolate  the  specific  item  in  which  an 
effect  due  to  sex  was  to  be  found,  each  of  the  terms  in 
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the  model  was  tested  under  the  general  form 

All  tests  were  made  using  a  preset  value  of  «  =  .05  and 
the  test  statistic 

MSR   ^     ^ 

F  =  F 

MSE    ^1^2 

The  examination  of  research  hypothesis  four, 
dealing  with  the  effects  of  prestige  on  tlie  index  of 
outcome,  was  based  on  the  following  model. 

Model  #3. 

J  =  b^  +  Vl  *  ^2^2  *  ^3^3  ^  ^A  ^  ^5^5  ^^6'^6  * 

b^x^  +  bgX  +  e,  where 

y  is  the  dependent  variable; 

b  is  the  mean  score  for  a  high  prestige  team 
°  debating  a  low  prestige  team  with  a  judge  from 
a  high  prestige  school; 

b,  is  the  mean  score  for  a  high  prestige  team 

debating  a  low  prestige  team  with  a  judge  from 
a  low  prestige  school,  minus  b  ; 

b  is  the  mean  score  for  a  high  prestige  team 
^  debating  a  low  prestige  team  with  a  judge  from 
a  junior  college,  minus  b  ; 

bo  is  the  mean  score  for  a  high  prestige  team 
debating  a  junior  college  with  a  judge  from  a 
high  prestige  school,  minus  b^; 

h^^   is  the  mean  score  for  a  high  prestige  team 
^  debating  a  junior  college  with  a  judge  from  a 
low  prestige  school,  minus  b  ; 
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hr    is  the  mean  score  for  a  high  prestige  team 
^  debating  a  junior  college  with  a  judge  from  a 
junior  college,  minus  ^^i 

b  is  the  mean  score  for  a  low  prestige  team 
6  debating  a  junior  college  with  a  judge  from  a 
high  prestige  school,  minus  b^; 

b^  is  the  mean  score  for  a  low  prestige  team 

debating  a  junior  college  with  a  judge  from  a 
low  prestige  school,  minus  b  ; 

bg  is  the  mean  score  for  a  low  prestige  team 

debating  a  junior  college  with  a  judge  from  a 
junior  college,  minus  b^; 

X-,  is  a  dummy  variable  (1  if  high/ low  with  low 
judge,  0  otherwise); 

Xj  is  a  dummy  variable  (1  if  high/low  with  J.C. 
judge,  0  otherwise); 

X-  is  a  dummy  variable  (1  if  high/j.c.  with 
high  judge,  0  otherwise); 

x^  is  a  dummy  variable  (1  if  high/j.c.  with  low 
judge,  0  otherwise); 

X  is  a  dummy  variable  ,(1  ;if  high/j.c.  with  j.c. 
judge,  0  otherwise); 

Xg  is  a  dummy  variable  (1  if  low/j.c.  with  high 
judge,  0  otherwise); 

x^  is  a  dummy  variable  (1  if  low/j.c.  with  low 
judge,  0  otherwise);   and 

Xg  is  a  dummy  variable  (1  if  low/j.c.  with  j.c. 
judge,  0  otherwise). 

Having  verified  that  the  derived  model  did  in  fact 

provide  a  good  fit  for  the  sample  data,  the  hypotheses 

were  tested  specifically  as  follows.   The  general 

hypothesis  of  some  effect  due  to  prestige  was  of  the 

form 
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H^:  at  least  one  3-  ^  0. 
a  1 

In  order  to  isolate  the  specific  item  in  which  an 

effect  due  to  prestige  was  to  be  found,  each  of  the 

terms  in  the  model  was  tested  under  the  general  form 

H  :  B.  =  0 
o    1 

H  :  3.  3«  0. 

All  tests  were  made  using  a  preset  value  of  «  .05  and  the 

test  statistic 

MSR  9 

F  = F 

MSE     V  i^  2 

The  examination  of  research  hypothesis  five,  dealing 
with  the  effects  of  "proximity"  on  the  index  of  outcome, 
was  based  on  the  following  model. 
Model  #4. 

byX_  +  e,  where 

y  is  the  dependent  variable; 

b  is  the  mean  score  for  a  debater,  with  opponents 
°  and  judge  from  his  own  district,  who  is  nearest 
to  the  judge; 

b,  is  the  mean  score  for  a  debater,  with  opponents 
and  judge  from  his  own  district,  who  is  farthest 
from  the  judge,  minus  b^; 

b,  is  the  mean  score  for  a  debater,  with  opponents 
from  his  own  district  and  a  judge  from  another 
district,  who  is  nearest  to  the  judge,  minus  b^; 
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bo  is  the  mean  score  for  a  debater,  with  opponents 
from  his  own  district  and  a  judge  from  another 
district,  who  is  farthest  from  the  judge,  minus  b^; 

b^  is  the  mean  score  for  a  debater,  with  opponents 
and  the  judge  each  from  different  districts,  who 
is  nearest  to  the  judge,  minus  b^; 

br  is  the  mean  score  for  a  debater,  with  opponents 
and  the  judge  each  from  different  districts,  who 
is  farthest  from  the  judge,  minus  b^; 

be  is  the  mean  score  for  a  debater  with  a  judge  from 
his  own  district  and  opponents  from  another 
district,  minus  b^; 

b  is  the  mean  score  for  a  debater  with  opponents 
"^  from  another  district  and  the  judge  from  his 
opponent's  district,  minus  b^; 

X,  is  a  dummy  variable  (1  if  of  the  conditions  for 
bi,  0  otherwise) ; 

Xo  is  a  dummy  variable  (1  if  of  the  conditions  for 
b2,  0  otherwise); 

x,  is  a  dummy  variable  (1  if  of  the  conditions  for 
bo,  0  otherwise); 

x,.  is  a  dummy  variable  (1  if  of  the  conditions  for 
b^,  0  otherwise); 

Xr  is  a  dummy  variable  (1  if  of  the  conditions  for 
br,  0  otherwise); 

Xg  is  a  dummy  variable  (1  if  of  the  conditions  for 
b_  ,  0  otherwise); 

X-  is  a  dummy  variable  (1  if  of  the  conditions  for 
by,  0  otherwise). 

Having  verified  that  the  derived  model  did  in 

fact  provide  a  good  fit  for  the  sample  data,  the  hypotheses 

were  tested  specifically  as  follows.   The  general 

hypothesis  of  some  effect  due  to  proximity  was  of  the 

form 
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H  :  at  least  one  3,'  ^  0 . 
a  J. 

In  order  to  isolate  the  specific  item  in  which  an 
effect  due  to  prestige  was  to  be  found,  each  of  the 
terms  in  the  model  was  tested  under  the  general  form 

H  :  3.  =  0 
o   1 

H^:  3i  ^  0. 

All  tests  were  made  using  a  preset/ value  of  01  =  .05  and 
the  test  statistic 

MSE     "l«2 


NOTES  -  CHAPTER  II 

The  years  covered  were  the  academic  years  196  7- 
68,  1968-59,  1969-70,  1970-71,  and  1971-72.   The 
tournaments  involved  were  the  Peachtree  Debate 
Tournament,  hosted  by  Emory  University  in  Atlanta, 
Georgia;  the  Birmingham  Invitational  Debate  Tournament , 
hosted  by  Samford  University  in  Birmingham,  Alabama; 
and  the  Gator  Invitational  Debate  Tournament,  hosted  by 
the  University  of  Florida  in  Gainesville,  Florida. 

2    .   • 

V/illiam  Mendenhall,  Lyman  Ott ,  and  Richard 

Schaeffer,  Elementary  Survey  Sampling  (Belmont, 
California:   Wadsworth  Publishing  Company,  Inc.,  1971), 
p.  61. 

3 

For  a  description  of  this  program,  see  W.J. 

Dixon, BMP  Biomedical  Computer  Programs  (University  of 

California  Publications  m  Automatic  Computation  No.  2, 

1972),  pp.  49-59. 

The  test  statistic  took  the  form: 


/1-rij 


2    /n-2 


For  a  discussion  of  this  test,  see  William  Hayes, 
Statistics  (New  York:  Holt,  Rinehart  and  Winston,  1968), 
pp.  529-533. 

^   For  a  description  of  this  program,  see  W.J. 
Dixon ,  BMD  Biomedical  Computer  Programs :  X-series 
Supplement  (University  of  California  Publications  in 
Automatic  Computation  No.  3,  1972),  pp.  90-103. 

c 

For  a  more  complete  description  of  this  program, 
see  Dixon,  X-series  Supplement,  pp.  2  3-33. 

n 

Actual  calculation  of  these  values  was  performed 
through  the  BMDX6  3  program. 

Actual  calculation  of  these  values  was  performed 
through  the  BMDX63  program. 
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Actual  calculation  of  these  values  was  performed 
through  the  BMDX6  3  program. 

10   Actual  calculation  of  these  values  was  performed 
through  the  BMDX6  3  program. 


CHAPTER   III 
RESULTS 

The  format  variables  "side  of  topic"  and  "speaker 
position"  were  the  first  to  be  studied,  using  Model  #1 
as  described  in  Chapter  II.   Examination  of  comparative 
wins  for  affirmative  and  negative  debate  teams  indicated 
no  significant  difference  in  outcome  due  to  the  side 
of  the  topic.   The  appropriate  regression  weights  for 
Model  #1,  as  calculated  by  BMDX63,  and  the  results  of 
individual  tests  for  significant  regression,  are  shown 
in  Table  III. 


TABLE  III 
Regression  Weights  for  Format  Variables 

Variable    Weight   F-score     d.f.    Interpretation 


xl 

.068 

0.009 

1,76 

p  >  .05 

x2 

.303 

0.179 

1,76 

p  >  .05 

x3 

.913 

1.629 

1,76 

p  >  .05 
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The  data  provide  no  evidence  of  an  effect  on  the 
outcome  of  intercollegiate  debates  due  to  the  side  of 
topic  or  speaker  positions.   For  this  reason,  analysis 
of  the  rater  bias  variables  did  not  employ  blocking 
along  format  variables. 

Further  analysis  of  these  data  focused  on  the 
regression  effects  of  side  of  topic  and  speaker  position 
on  each  of  the  four  dependent  measures  provided  by  the 
A.F.A.  Form  "C"  ballot.   Only  in  one  instance  did  the 
results  indicate  a  significant  effect.   The  calculated 
regression  weights  for  each  variable  and  the  F  statistic 
associated  with  each  are  shown  in  Table  IV. 

The  presence  of  regression  on  speeJcer  rank  indicates 
that  the  appropriate  formula  for  predicting  an  individuals 
speeiker  rank  would  be 

y  =  2.0  +  .75x,   where 

y  is  the  expected  rank  and  x  is  a  dichotomous  variable 
representing  the  second  affirmative  speaker. 

Blocking  across  time  within  a  tournament  also 
produced  significant  differences.   The  frequency  of  wins 
in  preliminary  rounds  appeared  to  differ  from  that  in 
elimination  rounds,  with  a  calculated  value  of  F 
significant  v/ith  p  <  .05.   A  Chi-square  analysis  of  the 
data  to  determine  exact  differences  revealed  that,  when 
considering  only  preliminary  rounds ,  negative  teeims  won 
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TABLE   IV 

Regression  Weights  for  Format  Variables  On 
Ballot  Measures 


Win-loss 

Variable 

Weight 

F-score 

d.f. 

'   xl 
x2 
x3 

-0.30 

-0.30 

0.00 

3.76 
3.76 
0.00 

1,76 
1,76 
1,76 

Speaker  Rating 

xl 
x2 
x3 

-2.25 
-1.70 
.0.80 

2.54 
1.45 
0.32 

1,75 
1,76 
1,76 

Team  Rating 

xl 
x2 
x3 

-0.15 

-0.15 

0.00 

0.28 
0.28 
0.00 

1,76 
1,76 
1,76 

Speaker  Rank 

xl 
x2 
x3 

.70 
.75 
.45 

3.92 
4.51 
1.62 

1,76 

1,76* 

1,76 

*  significant  with  p  <  .05 
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more  debates,   Chi-square  for  this  comparison  was 
significant  with  p  <  .10.   Negative  teams  also  won  more 
elimination  round  debates,  but  the,  probability  of  error 
in  this  statistic  exceeded  .10.   The  percentages  by  which 
negative  v/ins  exceeded  expected  frequencies  for  both 
preliminary  and  elimination  rounds  are  shown  in  Table  V. 


TABLE  V 
Percentages  of  Excessive  Negative  Wins 


Preliminary 
Rounds 

Elimination 
Rounds 

1967-68 

-  50% 

100% 

1968-69 

40% 

25% 

1969-70 

40% 

25% 

1970-71 

25% 

25% 

1971-72 

40% 

25% 

Totals  27%  68% 


Sex 

The  effect  of  the  rater  variable  "sex"  on  the  index 
of  outcome  was  examined  with  Model  #2,  as  described  in 
Chapter  II.   The  predictors  for  this  analysis  were 
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dichotomous  variables  representing  the  nine  possible 
combinations  of  sex  when  considering  a  debater,  his 
colleague,  and  the  judge.   The  calculated  regression 
weights,  F-scores,  and  interpretations  for  this  model 
are  shown  in  Table  VI . 


TABLE  VI 
Regression  Weights  for  Rater  Variable  Sex 


Variable 

Weight 

F-score 

d.f. 

Interpretation 

xl 

-0.75 

0.72 

1,92 

P 

> 

.05 

x2 

-1.03 

0.6  8 

1,92 

P 

> 

.05 

x3 

-0.26 

0.05 

1,92 

P 

> 

.05 

X<4 

-1.23 

0.97 

1,92 

P 

> 

.05 

x5 

-0.02 

0.00 

1,92 

P 

> 

.05 

x6  • 

-1.25 

1.31 

1,92 

P 

> 

.05 

x7 

-0.5  3 

0.18 

1,92 

P 

> 

.05 

These  data  provide  no  significant  evidence  of  any 
effect  on  outcome  due  to  sex.   Analysis  then  focused  on 
effects  of  sex  on  each  of  the  four  dependent  measures 
provided  on  the  A.F.A.  ballot. 
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When  considering  win-loss  as  the  dependent  measure, 
there  were  a  number  of  significant  differences  revealed 
by  the  data.   Sex  affected  win-loss  both  for  debaters 
and  for  judges.   These  results  are  shown  in  Table  VII. 


TABLE  VII 
Regression  By  Sex  On  Win-Loss  for  Groups 


Group 


F-score 


Male  debaters  1Q.37 

Female  debaters  10.25 

Male  judges  16.13 

Female  judges  5.22 


d.f.    Interpretation 


3,92 
4,92 

3,92 
6,92 


p  <  .01 
p  <  .01 

p  <  .01 
p  <  .01 


Interaction  between  the  sex  of  the  debater,  the  sex 
of  his  colleague ,  and  the  sex  of  the  j  udge  was  also 
considered.   In  all  cases,  the  calculated  regression 
weights  were  found  to  be  significant  with  p  <  *.05.   These 
results  are  shown  in  Table  VIII. 

When  considering  the  effects  of  sex  on  the  dependent 
measure  speaker  rating,  analysis  again  indicated  the 
presence  of  significant  regression.   No  significant 
effects  were  found  for  groups  in  speaker  rating.   However, 
this  was  not  the  case  when  considering  interaction  among 
the  sex  of  the  debater,  the  sex  of  his  colleague,  and  the 
sex  of  the  judge.   These  results  are  shown  in  Table  IX. 
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TABLE   VIII 
Regression  of  Sex  on  Win-Loss 


Variable 

(debater/colleague/judge) 


xl  male /male /female 

x2  male /female /male 

x3  male /female /female 

x4  female /male /male 

x5  female /male /female 

x6  female /female /male 

x7  female /female /female 


V/eight 


-0.54 
-0.88 
-0.44 

-0.88 
-0.44 

-0.83 
-0.50 


F- score 


18.47 

24.66 

6.91 

24.66 
6.91 

23.36 
8.05 
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TABLE   IX 


Regression  of  Sex  on  Speaker  Ratings 

Variable                     Weight   F-score   Interpretation 
(debater/colleague/ judge) \ 

xl     male /male /female       -0.30     0.08     p  >  .05 


x2  male /female /male  -3.30  H.94  p  <  .05 

x3  male /female /female  -1.47  1.06  p>.05 

x4  female /male /male  ,  -3.5  8  6.12  p  <  .05 

x5  female /male /female  -3.47  5.92  p  <  .05 

x6  female /female /male  0.28  0.05  p  >  .05 

x7  female /female /female  -2.30  2.40  p  >  .05 
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.  Sex  affected  team  ratings  both  for  debaters  and  by 
judges.  Regression  of  sex  on  team  ratings  by  groups  is 
summarized  in  Table  X, 


Group 


TABLE  X 
Regression  of  Sex  on  Team  Ratings  for  Groups 

F-score  d.f . Interpretation 


Male  debaters 
Female  debaters 


2.34  3,92       p  >  .05 

2.67  4,92       p  <  .05 


Male  judges  3.44  3,92       p  <  .05 

Female  judges  1-82  4,92       p  >  .05 


Once  again,  significant  interaction  was  found  to 
exist  among  the  sex  of  the  debater,  the  sex  of  the 
colleague,  and  the  sex  of  the  judge.   Ilie  regression 
weights  for  various  sex  combinations  and  the  results  of 
the  tests  for  significance  are  shown  in  Table  XI. 

When  considering  the  effects  of  sex  on  the  dependent 
measure  speaker  ranking,  analysis  again  indicated  the 
presence  of  significant  regression.   The  differences 
between  groups  were  tested,  and  the  results  are  shown 

in  Table  XII. 

As  was  the  case  for  "the  other  three  dependent 
measures,  speaker  ranking  revealed  interaction  effects 


TABLE  XI 
Regression  of  Sex  on  team  Ratings 
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Variable                     Weight   F-score   Interpretation 
(debater/colleague/judge)    \ 


xl  male /male /female 

x2  male /female /male 

x3  male /female /female 

x4  female /male /male 

x5  female/male/ female 

x6  female /female /female 

x7  female /female /female 


-0.09 


0.19 


p  >  .05 


-0.58 
-0.48 

4.53 
3.40 

P    < 

P    > 

.05 
.05 

-0.58 
-0.48 

4.53 
3.40 

P  < 
P    > 

.05 
.05 

0.13 
-0.45 

0.32 
2.77 

P  > 
P    > 

.05 
.05 

TABLE  XII 

Regression  of  Sex  on  Speaker  Rankings 
for  Groups 


Groups 


F-score 


Male  debaters 
Female  debaters 

Male  judges 
Female  judges 


4.09 
4.26 

6.60 
2.19 


d.f. 


3,92 
4,92 

3,92 
4,92 


Interpretation 


P  < 

P  < 


.05 
,01 


p  <  .01 
p  >  .05 
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between  the  sex  of  the  debaters  and  the  sex  of  the 
judge.   The  calculated  regression  weights,  with  their 
associated  F  statistics  and  interpretations ,  are  shown 
in  Table  XIII. 


TABLE  XIII 
Regression  of  Sex  on  Speaker  Rankings 

Variable                     Weight   F-score  Interpretation 

(debater/colleague/judge) 

xl   male /male /female           0.80     6.49  p  <  .05 

x2    male /female /male           1.43    10.42  p<.01 

x3   male /female /female         0.63     2.24  p  >  .05 

x4   female/male/male          1.4  3    10.42  p  <  -01 

x5    female/male /female         0.96     5.21  p  <  .05 

x6    female /female /male         1.30     11.38  p<  »01 

x7   female /female /female       0.80     3.28  p  >  .05 


prestige 

The  effect  of  the  rater  variable  "prestige"  on  the 
index  of  outcome  was  examined  with  Model  #3,  as  described 
in  Chapter  II.   The  predictors  for  this  analysis  were 
dichotomous  variables  representing  the  eighteen  possible 
combinations  of  high  prestige,  low  prestige,  and  junior 
college  debate  teams  and  judges.   The  appropriate, 
regression  weights  for  Model  #3,  as  calculated  by 
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BMDX6  3,  and  the  results  of  individual  tests  for  signifi- 
cant regression,  are  shown  in  .Table  XIV, 

Since  none  of  the  above  items  reached  a  probability 
level  of  .95  or  greater,  the  model  was  re-evaluated. 
On  the  basis  of  this  evidence,  the  expected  result  for 
teams  at  all  prestige  levels  would  be  that  team's 
average  index  of  outcome  in  past  debates. 

Confirmation  of  these  results  was  sought  by 
examining  the  regression  of  prestige  on  each  of  the 
four  dependent   measures  provided  by  the  A.F.A.  Form 
"C"  ballot. 

For  the  dependent  measure  win-loss,  significant 
regression  effects  were  found  for  both  low  prestige  and 
junior  college  debate  teams.   In  both  instances,  the 
calculated  F  statistic  was  found  to  be  significant  with 
p  <  .01.   When  interaction  between  the  prestige  of  the 
teams  in  a  given  round  and  the  prestige  of  the  judge 
were  considered,  a  number  of  significant  differences 
were  found  to  exist.   These  results  are  shown  in  Table  XV. 

When  considering  the  dependent  measure  speaker 
rating,  the  prestige  of  a  debate  team  was  found, to  have 
a  specific  effect  with  p  <  .05.   Significant  regression 
effects  were  found  for  debate  teams  at  all  prestige 
levels,  and  for  the  scores  given  by  judges  of  all  prestige 
levels.   In  each  instance,  the  calculated  F  statistic  was 
found  to  be  significant  with  p  <  ,05. 
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TABLE  XIV 
Regression  Weights  for  Rater  Variable  Prestige 

Variable  VJeight  F-score d.f . Interpretation 

xl  0.21     .015      1,102      p  >  .05 

x2  -1.05      .331      1,102      p  >  .05 

x3  1.54     .802      1,102      p  >  .05 

xU  1.10      .1+10      1,102      p  >  .05 

x5  0.32      .034       1,102       p  >  .05 


x6 

0.39 

.034 

1,102 

P 

> 

.05 

x7 

1.07 

.413 

1,102 

P 

> 

.05 

x8 

1.52 

.702 

1,102 

P 

> 

.05 

x9 

0.83 

.268 

1,102 

P 

> 

.05 

xlO 

0.93 

.316 

1,102 

P 

> 

.05 

xll 

-0.12 

.006 

1,102 

P 

> 

.05 

xl2 

0.11 

.004 

1,102 

P 

> 

.05 

xl3 

1.90 

1.221 

1,102 

P 

> 

.05 

xl4 

2.01 

1.371 

1,102 

P 

> 

.05 

xl5 

1.58 

.972 

1,102 

P 

> 

.05 

xl5 

1.07 

.418 

1,102 

P 

> 

.05 

xl7 

1.43 

.472 

1,102 

P 

> 

.05 
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TABLE  XV 

Regression  of  Prestige  on  Win-Loss 

Variable                 Weight  F-score  Interpretation 

(team/opponent/ judge) 

xl   high/low/low           -0.25  0.58  p  >  .05 

x2   high/low/jr.coll.      -0.50  1.93  p  >  .05 

x3   high/ jr. coll. /high      0.00  0.00  p  >  .05 

x4   high/jr. coll. /low      -0.33  0.96  p>  .05 

x5   high/jr. coll. /jr. coll.  -0.67  3.85  p  >  .05 

x6   low/hihg/high         -1.00  5.77  p  <  .05 

x7   low/high/low           -0.75  5.20  p<  .05 

x8   low/high/ jr. coll.      -0.50  1.93  p  >  .05 

x9   low/ jr. coll. /high      -0.17  0.28  p>  .05 


xlO  low/jr. coll. /low 


-0.50     2.31       P  >  .05 


xll  low/jr. coll. /jr. coll.  -1.00  9.24  P  <  .01 

xl2  jr.coll./high/high  -1.00  8.66  p  <  .01 

xl3  jr. coll. /high/low  -0.67  3.85  p  >  .05 

xl4  jr. coll. /high/jr. coll.  -0.33  0.96  p  >  .05 

xl5  jr. coll. /low/high  -0.83  6.87  p  <  .05 

xl6  jr. coll. /low/ low  -0.50  2.31  P  >  .05 

xl7  jr.coll. /low/jr. coll.  -0.00  0.00  p  >  .05 
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TABLE  XVI 

Regression  of  Prestige  on  Speaker  Rating 

Variable                Weight  F-score  Interpretation 

( team/  opponent  /  j  udge ) ^ 

xl  high/low/low          -9.00  8.84  p  <  .01 

x2   high/low/jr.coll.     -3.50  1.11  p  >  .05 

x3  high/jr. coll. /high    -3.17  1.03  p  >  .05 

x4  high/jr.coll./low     -5.67  3.28  p  >  .05 

x5   high/jr. coll. /jr.coll. -7. 17  5.25  p<.05 

x6   low/high /high         -4.50  1.38  p  >  .05 

x7   low/high/low         -10.50  12.03  p<.01 

x8   low/high/jr.coll.     -4.00     1.45  P  >  .05 

x9   low/ jr. coll. /high     -5.08     3.02  p  >  .05 

xlO  low/ jr. coll. /low      -8.63     8.11  P  <  .01 

xll  low/jr.coll./jr.coll.-10.50  12.03  P  <  .01 

xl2  jr. coll. /high/high    -5.50     3.09  P  >  .05 

xl3  jr. coll. /high/low     -6.83     4.78  P  <  .05 

xl4  jr.coll./high/jr.coll.-7.83     6.28  p  <  .05 

xl5  jr. coll. /low/high    -10.00  11.69  P  <  .01 

xl6  jr. coll. /low/low      -8.13     7.20  P  <  .01 

xl7  jr. coll. /low/jr. coll.  -9.13     9.08  P  <  .01 
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When  interaction  between  the  prestige  of  the  teams 
in  a  given  round  and  the  prestige  of  the  judge  was 
considered,  a  number  of  significant  differences  were 
found  to  exist.   These  results  are  shown  in  Table  XVI. 

When  considering  the  dependent  measure  team  rating, 
analysis  revealed  significant  differences  both  among 
expected  scores  by  teams  and  from  judges  at  the  different 
levels  of  prestige.   These  results  are  summarized  in 
Table  XVII. 


TABLE  XVII 

Regression  of  Prestige  on  Team  Ratings  for 
Groups 


Group  F-score  d.f.  Interpretation 

High  prestige  teams    2,88  5,102  p  <  .05 

Low  prestige  teams     3.81  6,102  p  <  .01 

Junior  college  teams    H.38  6,102  p  <  .01 

High  prestige  judges   5,10  5,102  p  <  ,01 

Low  prestige  judges    2,76  6,102  p  <  .05 

Junior  college  judges   4.25  6,102  p  <  .01 


Interaction  between  the  prestige  of  the  two  teams,  in 
the  debate  round  and  the  prestige  of  the  judge  was  also 
tested.   Again,  significant  differences  in  the  expected 
team  ratings  were  discovered.   The  calculated  regression 
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weight  for  each  combination,  along  with  its  F  statistic 
and  the  interpretation  of  that  score,  is  shown  in 
Table  XVIII. 

When  considering  the  dependent  measure  speaker  ranking, 
analysis  revealed  significant  difference  only  for  the 
group  of  low  prestige  debate  teams.   The  F  statistic 
calculated  for  this  group  by  BMDX6  3  was  foxmd  to  be 
significant  with  p  <  .05.   No  significant  differences 
were  found  for  teams  at  other  prestige  levels,  or  for  the 
scores  given  by  judges  at  any  prestige  level.   However, 
when  interaction  between  the  prestige  levels  of  the  debate 
teams  and  the  pr^estige  level  of  the  judge  was  considered, 
a  number  of  significant  differences  were  revealed.   The 
regr-ession  weights  for  each  combination,  along  with  the 
appropriate  F  statistic  and  its  interpretation,  are 
shown  in  Table  XIX. 

Proximity 

The  effect  of  the  rater  variable  "proximity"  on  the 
index  of  outcome  was  examined  with  Model  #4,  as  described 
in  Chapter  II.   The  predictors  for  this  analysis  were 
dichotomous  variables  representing  the  eight  possible 
combinations  of  teams  and  judges  when  considering  both 
geographical  proximity  and  N.D.T.  district  memberships. 
The  appropriate  regression  weights  for  Model  #4,  as 
calculated  by  EMDX6  3,  and  the  results  of  individual  tests 
for  significant  regression,  are  shown  in  Table  XX. 
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TABLE   XVIII 
Regression  of  Prestige  on  Team  Ratings 

Variable.                   Weight  F-score   Interpretation 
(team/opponents/ judge) ■    

xl   high/low/low            -1.75     7.99  p  <  .01 

x2   high/low/jr.coll.       -0.50     0.54  p  >  .05 

x3   high/jr. coll. /high      -1.00     2. 45  p  <  .05 

x4   high/jr. coll. /low       -1.33     4.35  p  <  .01 

x5   high/jr. coll. /jr. coll.   -1.67     6.80  p<.01 

x6    low/high/high           -I'OO     ^'^^  p>  .05 

x7    low/high/low            -2.00  10.45  p  <  .01 

x8    low/high/jr.coll.       -1.00     2.18  p  >  .05 

x9   low/jr. coll. /high       -1.33     4.97  p  <  .01 

xlO. low/ jr. coll. /low        -2.00  10.45  p  <  .01 

xll   low/jr. coll. /jr. coll.    -2.25  13.33  p  <  .01 

xl2   jr. coll. /high/high      -1.33     ^.^B  p  <  .01 

xl3   jr. coll. /high/low       -2.33  13.33  p  <  .01 

xl4   jr. coll. /high/jr. coll.   -2.00      9.79  p  <  .01 

xl5   jr. coll. /low/high       -2.50  17.49  p  <  .01 

xl6   jr. coll. /low/low        -1.75     7.99  p  <  .01 

xl7   jr. coll. /low/jr. coll.    -1.50     5.88  p  <  .01 
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TABLE  XIX 
Regression  of  Prestige  on  Speaker  Rankings 

Variable                   Weight   F-score   Interpretation 
( team/ opponent /judgei) ■ 


xl  high/ low/ low 

x2  high/low/jr.coll. 

x3  high/ jr. coll. /high 

c4  high/ jr. coll. /low 

x5  high/ jr. coll. /jr. coll, 

x5  low/high/high 

x7  low/high/low 

x8  low/high/ jr. coll. 


0.25  0.09  p  >  .05 

0.50  0.29  p  >  .05 

0.33  0.15  p  >  .05 

0.83  0.93  p  >  .05 

1.00  1.34  P  >  .05 

2.00  3.57  p  <  .01 

1.75  4.37  p  <  .01 

1.50  2.67  p  <  .05 


x9   low/ jr. coll. /high        0.25    0.09     p>.05 
xlO   low/jr. coll. /low 


xll  low/jr. coll. /jr. coll, 


0.87     1.09     p  >  .05 
1.38    2.70     p  <  .05 


xl2   jr. coll. /high/high       1.67     3.71     p<.01 

xl3  jr. coll. /high/ low 

xl4  jr. coll. /high/jr. coll. 


xl5  jr. coll. /low/high 
xl6   jr. coll. /low/low 


1.17  1.82  p  >  .05 

1.00  1.34  P  >  .05 

1.75  4.68  P  <  .01 

1.13  1.81  P  >  '05 


xl7  jr. coll. /low/jr. coll.     0.50     0.36     P  >  -05 
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TABLE  XX 
Regression  Weights  for  Rater  Variable  Proximity 


Variable 

Weight, 

F-score 

Interpretai 

xl 
x2 
x3 

2.87 
1.36 
0.32 

6.12 
1.24 
0.07 

p  <  .05 
p  >  .05 
p  >  .05 

x4 
x5 
x6 

1.83 
1.59 
1.77 

2.66 
2.00 
2.51 

p  >  .05 
p  >  .05 
p  >  .05 

x7 


1.20 


1.15 


p  >  .05 


Further  explanation  of  the  mixed  results  among  the 
above  tests  was  sought  by  considering  the  regression  of 
proximity  on  each  of  the  four  dependent  measures  provided 
by  the  A.F.A.  Form  "C"  ballot. 

For  the  dependent  measure  win-loss,  no  significant 
regression  effects  were  discovered,  either  for  geographical 
proximity  or  for  district  membership.   Examination  of 
interaction  effects  also  produced  no  significant 
differences.   In  all  cases,  p  >  .05. 

When  considering  the  dependent  measure  speaker 
rating,  the  proximity  of  a  debate  team  to  the  judge  was 
found  to  have  a  significant  effect  with  p  <  .05.   When 
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interaction  between  teams'  and  judges'  district  member- 
ships was  considered,  a  number  of  significant  differences 
were  found.   These  results  are  shown  in  Table  XXI. 
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TABLE  XXI 
Regression  Of  Proximity  On  Speaker  Ratings 

Variable Weight F-score  Interpretation 

xl  -0.92  0.42 

x2  -1.85  1.56 

x3  -4.65  11.13 

x4  -0.11  0.01 

x5  -1.5"+  1.27 

x6  -1.11  0.66 


p 

> 

.05 

p 

> 

.05 

p 

< 

.01 

p 

> 

.05 

p 

> 

.05 

p 

> 

.05 

x7 


-1.54  1.27  p  >  .05 


The  results .fore tests  of  the  effect  of  proximity  on 
the  dependent  measure  team  rating  were  similar  to  those 
for  speaker  rating,   liflien  interaction  effects  were  tested, 
a  number  of  significant  differences  were  discovered. 
The  calculated  regression  weight  for  each  combination, 
along  with  its  F  statistic  and  the  interpretation  of 
that  score,  is  shown  in  Table  XXII. 
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TABLE   XXII 
Regression  of  Proximity  on  Team  Rating 


Variable 

Weight 

F-score 

Interpretat: 

xl 

-0.33 

1.60 

p  >  .05 

x2 

-0.23 

0.71 

p  >  .05 

x3 

-0.63 

5.24 

p  <  .05 

x4 

-0.26 

1.06 

p  >  .05 

x5 

-O.Hl 

2.5H 

p  >  .05 

x6 

-0.41 

2.54 

p  >  .05 

x7  -0.26         1.06  P  >  .05 


Examination  of  the  dependent  measure  speaker  ranking 
revealed  no  significant  differences  between  means  for  the 
various  groups.   This  trend  continued  when  interaction 
was  considered.   In  all  cases,  the  calculated  regression 
weights  were  found  not  to  be  significant. 


CHAPTER   IV 
CONCLUSIONS 

The  results  of  this  study  indicate  that  the  format 
variables  "side  of  topic"  and  "speaker  position"  have 
no  significant  effect  on  the  overall  outcome  of  inter- 
collegiate debates  as  measured  by  the  dependent  variable 
index  of  outcome.   This  result  was  confirmed  when  the 
effect  of  format  variables  on  the  dependent  measure 
win-loss  was  considered.   The  thrust  of  this  evidence 
was  to  support  the  validity  of  the  format  currently  used 
in  intercollegiate  debate. 

Specifically,  hypotheses  1.1,  1.2,  1.3,  2.1,  2.2, 
and  2.4  were  not  rejected.   In  terms  of  overall  performance 
and  of  win- loss,  there  is  no  evidence  to  suggest  that  the 
format  of  intercollegiate  debating  is  biased  tov;ard 
either  side  or  any  speaker  position. 

Hypothesis  2.3  was  rejected,  with  «  =  .05.   The 
corrected  model  for  the  prediction  of  speaker  rank  on  the 
basis  of  speaker  position  was  as  follows. 

y  =  2.0  +  .75x  +  e, 

where  y  is  the  expected  rank  and  x  is  a  dichotomous 
variable  representing  the  second  affirmative  speaker. 
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The  rejection  of  hypothesis  2.3  leads  to  the  conclusion 
that  second  affirmative  speakers  tend  to  rank  lower  than 
speakers  in  any  other  position.   This  raises  doubts  as 
to  the  validity  of  using  speaker  rank  as  a  factor  in 
determining -the  outcome  of  intercollegiate  debates.   The 
use  of  the  measure  would  unfairly  handicap  second 
affirmative  speakers. 

The  rater  variable  "sex"  appears  to  have  a 
significant  effect  on  some  measures  of  the  outcome  of 
debates.   In  terms  of  the  overall  measure  index  of 
outcome,  there  was  no  observed  effect  due  to  sex.   This 
finding  disagrees  with  the  conclusions  of  Hayes  and 
McAdoo  and  Hens ley  and  Strother. 

However,  when  focusing  directly  on  the  dependent 
measure  win- loss,  the  conclusions  of  Hens ley  and  Strother 
were  confirmed.   Specifically,  hypotheses  3.1,  3.4,  and 
3.7  were  rejected  with  «  =  .05.   The  corrected  form  of 
Model  #2  for  prediction  of  win-loss  results  by  sex  was 
al  follows. 

y   -    1.00  -  .54Xj^  -  .88X2  -  .^i+Xg  -  .88x^  - 

.i+4x_  -  .83Xg  -  .50x7  +  e 

where  x.  represents  a  male  team  before  a  female  judge; 

x^,  X4  represent  mixed  teams  before  a  male  judge; 

X3,X5  represent  mixed  teams  before  a  female  judge; 

X  represents  a  female  team  before  a  male  judge;  and 
6 

x^  represents  a  female  team  before  a  female  judge. 
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If  y  >  .5  is  defined  as  an  expected  win,  and  y  <  ,5 
as  an  expected  loss,  for  any  given  debate,  then  these 
results  indicate  that  all-male  teams  had  a  greater 
expectation  of  winning  before  a  male  than  before  a  female 
judge.   Mixed  teams  and  all-female  teams,  however,  had  an 
expected  loss  from  male  judges  and  an  expected  win  from 
female  judges. 

When  considering  the  dependent  measure  speaker 
rating,  hypotheses  3.2  and  3.5  were  rejected.   The 
corrected  form  of  Model  #2  for  prediction  on  the  basis  of 
sex  was  as  follows. 

y    =  22.80  -  S.Sx^  -  3.68Xj^  -  3.47x^  +  e. 

This  model  indicates  that  the  members  of  mixed 
teams  received  lower  ratings  than  either  all-male  or 
all-female  teams.   Before  a  male  judge,  the  expected 
speaker  rating  for  the  male  member  of  a  mixed  team  was 
19.50,  as  compared  to  22.80  for  a  male  debater  with  a 
male  colleague  before  a  male  judge.   The  expected  rating 
for  the  female  member  of  a  mixed  team  before  a  male  judge 
was  19.12.   When  debating  before  a  female  judge,  the 
female  in  a  mixed  team  had  an  expected  rating  of  19.33. 

The  dependent  measure  team  rating  also  revealed 
significant  effects  due  to  sex.   In  general,  female 
debaters  tended  to  be  associated  with  lower  team  ratings 
than  did  male  debaters.   Conversely,  male  judges  tended 
to  give  lower  team  ratings  than  female  judges. 
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The  corrected  form  of  Model  #2    for  predicting  team 
rating  on  the  basis  of  sex  was  as  follows, 

y  =  3.70  -  .58x  +  e, 

when  X  represents  mixed  teams  before  a  male  judge. 
The  data  also  provided  sufficient  evidence  to 
support  the  rejection  of  hypotheses  3.3  and  3.6  with 
«   =  .05.   The  corrected  form  of  Model  j^2  for  predicting 
speaker  rank  by  sex  was  as  follows. 

y  =  1.70  +  .BOx^  +  1.U3X2  +  l«^3x.^.  + 

.96Xc  +  1.30x^  +  e 
o         6 

This  model  indicates  that  the  members  of  mixed 
teams  were  ranked  lower  by  male  than  by  female  judges. 
The  expected  rankings  were  3.13  as  compared  to  1,70. 
For  all-male  teams,  higher  rankings  came  from  male 
judges  (1.7  as  compared  to  2,5  for  female  judges).   For 
all-female  teams,  better  rankings  were  received  from 
female  than  from  male  judges. 

The  presence  of  significant  interactions  betv;een 
the  sex  of  the  debaters  and  the  sex  of  the  judge 
represents  a  serious  challenge  to  the  integrity  of 
intercollegiate  debate.   Immediate  z-'esearch  is  needed 
to  discover  means  of  compensating  for  the  biases  revealed 
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by  this  study.   Until  such  means  can  be  discovered,  debate 
judges  need  to  take  rigorous  steps  to  blot  out  whatever 
biases  are  affecting  their  decisions. 

It  should  be  noted  in  passing  that  this  researcher 
did  not  accept  the  results  of  this  study  as  indicative 
of  a  difference  in  performance  between  the  sexes.   Such 
a  difference,  however,  if  it  did  exist,  might  represent 
the  cause  of  the  significant  regression  effects  discussed 
here.  An  experimental  design  which  could  hold  ability 
constant  while  manipulating  sex  would  shed  more  light  on 
the  issue,  and  would  be  of  great  value  to  the  field  of 
forensics. 

The  study  also  provided  evidence  of  a  significant 
effect  due  to  the  rater  variable  "prestige."  While 
the  overall  measure,  index  of  outcome,  revealed  no 
significant  differences,  each  of  the  four  measures 
from  the  A.F.A.  Form  "C"  ballot  indicated  some  effect 
due  to  prestige.   Hypotheses  H.l,  4.2,  and  4.3  were  all 
rejected  with  «  =  ,05. 

VThen  considering  the  dependent  measure  win-loss,  the 
corrected  form  of  Model  §3  for  prediction  on  the  basis  of 
prestige  was  as  follows. 

y  =  1.00  -  l.OOXg  -  .75x^  -  l.OOx^^  -  l.OOx^g  ' 

.83x    e. 
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where  x.  represents  a  low  prestige  team  debating  a 
high  prestige  team  before  a  high  prestige 
judge; 

x^   represents  a  low  prestige  team  debating  a 
high  prestige  team  before  a  low  prestige 
judge; 

X, ,  represents  a  low  prestige  team  debating  a 
junior  college  before  a  junior  college 
judge; 

X, „  represents  a  junior  college  team  debating 
a  high  prestige  team  before  a  high  prestige 
judge;  and 

X,  (.  represents  a  junior  college  team  debating 
a  low  prestige  team  before  a  high  prestige 
judge. 

This  model  indicates  that,  except  for  the  five  cases 
mentioned  above,  there  is.no  effect  due  to  prestige  in 
the  win- loss  outcome  of  intercollegiate  debates.   However, 
low  prestige  teams  have  an  expected  loss  when  debating  high 
prestige  teams  before  anything  other  than  a  junior  college 
judge  (y  <  .5  is  defined  as  an  expected  loss). 

Low  prestige  teams  also  have  an  expected  loss  when 
debating  a  junior  college  team  before  a  junior  college 
judge.   Junior  college  teams  debating  before  a  high 
prestige  judge  had  an  expected  loss  (y  =  0.00,. 17)  when 
debating  anything  other  than  another  junior  college 
team. 

When  considering  the  dependent  measure  speaker  rating, 
the  corrected  form  of  Model  #3   for  prediction  on  the  basis 
of  prestige  was  as  follows . 
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y  =  28.00  -  9. Ox   -  7.17x   -  10. 5x^  -  8.63x^q  - 
10. 5x^^  -  6.33x3^3  -  7.83x^^  -  ^^•^'^IS  ~ 

8.13x,^  -  9.13x^^  +  e  , 
ib         17 

where   x   represents  a  high  prestige  team  debating  a 
■'■   low  prestige  team  before  a  low  prestige 
judge; 

X   represents  a  high  prestige  team  debating 
^   a  junior  college  before  a  junior  college 
judge; 

x,f,  represents  a  low  prestige  team  debating  a 
junior  college  before  a  lov7  prestige 
judge; 

x^ .,  represents  a  junior  college  team  debating 
a  high  prestige  team  before  a  low  prestige 
judge; 

X,  ^^  represents  a  junior  college  team  debating 

a  high  prestige  team  before  a  junior  college 
judge; 

X   represents  a  junior  college  team  debating 
^^   a  low  prestige  team  before  a  low  prestige 
judge;  and 

X  „  represents  a  junior  college  team  debating 
•^'  a  low  prestige  team  before  a  junior  college 
judge. 

This  model  indicates  that,  for  ten  of  eighteen 
possible  combinations,  prestige  had  an  effect  on  speaker 
rating.   High  prestige  teams  debating  low  prestige  teams 
before  high  prestige  and  junior  college  judges  had  an 
expected  speaker  rating  of  2  8.0,  as  compared  with  an 
expected  rating  of  19.0  against  a  similar  team  but  before 
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a  low  prestige  judge.   Against  junior  college  teams, 
high  prestige  debaters  had  an  expected  rating  of  2  8.0 
from  high  and  low  prestige  judges,  but  only  20.83  from 
junior  college  judges. 

Against  high  prestige  teams,  low  prestige  debaters 
had  an  expected  rating  of  2  8.0  from  high  prestige  and 
junior  college  judges,  but  only  17.5  from  low  prestige 
judges. 

The  rating  scores  of  junior  college  debaters  were 
the  most  affected  by  prestige.   Against  all  low  prestige 
teams,  junior  college  debaters  had  an  average  expected 
rating  of  18.92,  with  lowest  scores  coming  from  high 
prestige  judges  (y  =  18.00)  and  highest  scores  coming 
from  low  prestige  judges  (y  =  19.87).   Against  high 
prestige  teams,  junior  college  debaters  received  their 
highest  scores  from  high  prestige  judges  (y  =  2  8.0),  and 
their  lowest  scores  from  junior  college  judges  (y  = 
21.17). 

The  effect  of  prestige  pn  scores  awarded  by  judges 
was  most  heavily  concentrated  iln.  low  prestige  and  junior 
college  judges.   In  only  one  of  six- possible  combinations 
did  prestige  have  an  effect  on  the  scores  given  by  high 
prestige  judges  (x   >.   Low  prestige  judges  were  affected 
by  prestige  in  five  of  the  six  possible  combinations 
(omitting  only  Xj^ ) ,  and  junior  college  judges  showed  an 
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effect  in  four  of  the  six  prestige  combinations.  In 
general,  high  prestige  judges  awarded  higher  speaker 
ratings  than  judges  in  either  of  the  other  two  categories 

IVhen  considering  the  dependent  measure  team  rating, 
prestige  was  found  to  have  a  significant  effect  in  the 
rating  for  teams  at  all  prestige  levels  as  well  as  in 
the  scores  awarded  by  judges  at  all  prestige  levels. 
These  are  the  data  reflected  in  Table  XVII  in  Chapter  III. 
The  corrected  form  of  Model  #3  for  prediction  of  team 
ratings  on  the  basis  of  prestige  was  as  follows. 

y   =  5.0  -  1.75X  -  l.OOx  -  1.33X  -  l.eVXg  - 
1  3  4- 

2.00x^  -  1.33xg  -  2.00x^Q  -  2.26X3^3^  -  1.33x^2  " 
2.33x^3  -  2.00x^^  "  ^-SOx^^  -  l-^^^6  " 
l.SOx^^  +  e, 

where   x   represents  a  high  prestige  team  debating  a 
2   junior  college  before  a  high  prestige 
judge; 

X   represents  a  high  prestige  team  debating 
^      a  junior  college  before  a  low  prestige 
judge;  and 

X   represents  a  low  prestige  team  debating 
2   a  junior  college  before  a  high  prestige 
judge. 

This  model  indicates  that  in  fourteen  of  eighteen 
possible  combinations,  prestige  had  a  significant  effect 
on  team  rating.    High  prestige  teams  had  the  highest 
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expected  team  ratings,  with  lov;  prestige  teams  next. 
Junior  college  teams  had  the  lowest  expected  ratings. 
The  one  combination  where  this  did  not  hold  involved 
ranking  by  a  junior  college  judge. 

Irrespective  of  the  prestige  level  of  the  opposing 
team,  high  prestige  teams  received  their  highest  ratings 
from  high  prestige  judges,  as  did  low  prestige  teams. 
Junior  college  teams,  however,  received  their  highest 
ratings  from  junior  college  judges. 

Low  prestige  judges  appeared  to  have  reacted  most  to 
prestige.   The  ratings  awarded  by  these  judges  showed 
significant  regression  in  every  one  of  the  six  combinations 
studied.   High  prestige  judges  and  junior  college  judges 
were  affected  in  four  of  the  six  combinations.   High 
prestige  judges  awarded  the  highest  team  ratings  in  five 
of  the  six  combinations.   Junior  college  judges  awarded 
the  highest  team  ratings  in  three  of  the  six  combinations. 
(Two  combinations  received  the  same  ratings  from  high 
prestige  and  junior  college  judges.)   Low  prestige 
judges  tended  to  award  the  lowest  team  ratings. 

As  with  the  other  dependent  measures  from  the  A.F.A. 
ballot,  speaker  rankings  showed  an  effect  due  to  prestige. 
The  corrected  form  of  Model  #3  for  prediction  of  speaker 
rank  by  prestige  was  as  follows. 

Cg  +  1.75X7  +  ..o.g   •   --—11 


y  =  1.5  +  2. Ox-  +  1.75x7  +  1.5xg  +  1.38x   + 


1.67x^2  *   ^•'^^'^15  *  ^» 


75 


where   x   represents  a  low  prestige  debater  against  a 
2   high  prestige  team  before  a  junior  college 
judge. 


This  model  indicates  that  the  rankings  of  high 
prestige  debaters  were  unaffected  by  the  prestige  of 
the  opposing  team  or  the  prestige  of  the  judge.   For  all 
cases,  the  expected  ranking  of  a  high  prestige  debater 
was  1.50. 

The  rankings  of  low  prestige  debaters  were  affected 
by  prestige  in  four  of  the  six  combinations  studied. 
These  debaters  received  their  best  rankings  when  debating 
junior  college  teams.   The  rankings  of  junior  college 
debaters  were  affected  by  prestige  in  two  of  the  six 
combinations  studied.   Significant  differences  appeared 
only  when  these  debaters  were  speaking  before  a  high 
prestige  judge  and,  in  those  instances,  junior  college 
debaters  received  their  worst  rankings. 

It  must  be  noted  that  the  results  of  this  study 
cannot  be  taken  as  conclusive  evidence  that  judges  are 
affected  by  the  prestige  of  the  teams  they  hear  debate. 
Differences  in  outcome  may  be  caused  by  differences  in 
the  performance  by  the  debaters.   High  prestige  debaters 
may  be  better  than  low  prestige  or  junior  college 
debaters,  and  this  may  account  for  their  superior  results. 
However,  the  sample  for  this  analysis  was  drawn  only  from 
elimination  rounds  and  directly  power-matched  rounds. 
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In  these  cases ,  "the  performance  levels  of  the  two  teams 
are  theoretically  very  close.   The  presence  of  statistically 
significant  differences  is  therefore  disturbing.   Further 
research  is  called  for  to  increase  awareness  and  under- 
standing of  the  effects  of  prestige. 

Because  prestige  is  a  rater  variable,  it  may  not 
be  possible  to  eliminate  its  effects  by  manipulating 
tournament  procedures.   One  practice  which  might  have  a 
limited  success  would  be  the  elimination  of  school  names 
from  all  tournament  assignment  sheets,  and  the  use  of 
randomly  assigned  team  numbers  to  designate  participants 
during  a  tournament.   This  is  the  practice  at  some 
tournaments ,  and  it  would  be  effective  in  those  instances 
where  the  teams  az-^e  not  personally  known  by  the  judge 
in  a  round. 

The  results  of  this  study  indicate  that  the  rater 
variable  "proximity"  also  had  some  effect  on  the  measures 
of  the  outcome  of  intercollegiate  debates.   In  terms 
of  the  overall  measure,  index  of  outcome,  the  corrected 
form  of  Model  #U  for  prediction  on  the  basis  of  proximity 
was  as  follows. 

y  =  -l.ui  -»■  2.87X   +  e, 

where  x  represents  the  team  farthest  from  the  judge  when 
both  tea.iis  and  the  judge  are  from  the  same  N.D.T.  district. 
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This  model  indicated  that ,  within  any  given  N.D.T. 
district,  proximity  was  a  negative  influence.   Perhaps, 
in  this  case,  proximity  led  to  the  growth  of  rivalries 
rather  than  friendships.   ^Vhatever  the  explanation, 
these  results  are  counter  to  those  from  Brooks.   For 
the  dependent  measure  index  of  outcome,  hypotheses  5.1 
was  rejected  with  «  =  .05.   However,  there  was  insufficient 
evidence  in  the  results  to  reject  hypotheses  5.2,  5.3, 
or  5.4. 

When  considering  the  dependent  measure  win*loss, 
the  findings  v;ere  slightly  different.   No  regression 
effects  sufficient  to  reject  any  of  the  hypotheses  were 
found.   In  the  case  of  the  dependent  measure  speaker 
rating,  the  corrected  form  of  Model  #4  for  prediction  oh 
the  basis  of  proximity  was  as  follows. 

y  =  22.25  -  4.65x3  +  e, 

where  Xo  represents  the  team  farthest  from  the  judge 
when  that  judge  comes  from  an  N.D.T.  district  different 
from  that  of  the  two  teams  in  the  round. 

This  model  supports  the  rejection  of  hypothesis  5.2 
with  «  =  .05,  and  in  this  case  offers  support  for  the 
conclusions  presented  by  Brooks. 

The  dependent  measure  speaker  rank  produced  no 
significant  regression  effects.   However,  in  the  case  of 
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team  rating,  there  was  an  observable  effect.   The  corrected 
form  of  Model  #4  for  predicting  team  rating  on  the  basis 
of  proximity  was  as  follows. 

y  =  3.83  -  .6  3x   +  e. 

This  model  contradicts  the  results  of  testing  on  . 
speaker  rating,  and  tends  to  oppose  Brooks'  conclusions. 

The  findings  reported  here  offer  conflicting  evidence 
as  to  the  effect  of  geographical  proximity  on  the  outcome 
of  intercollegiate  debates.   In  view  of  Brooks'  more 
consistent  results,  additional  research  is  needed  to 
further  explain  the  effects  of  this  rater  variable. 

This  study  does  provide  consistent  evidence  that 
there  is  no  effect  due  to  N.D.T.  district  loyalties  or' 
animosities.   In  view  of  this,  perhaps  the  National 
Debate  Tournament  is  taking  an  unnecessary  precaution 
by  forbidding  judges  to  hear  a  team  from  their  own 
district  against  a  team  from  another  district. 

For  those  effects  of  proximity  which  do  exist, 
there  seem  to  be  two  alternatives  open  to  tournament 
directors.   The  first  would  involve  balancing  the 
geographical  distance  of  the  teams  from  the  judge.   The 
second  procedure  has  already  been  discussed,  i.e.  using 
only  team  identification  numbers  in  the  course  of  a 
tournament. 
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It  should  be  emphatically  noted  that  the  research 
reported  here  is  only  a  beginning.   Much  more  attention, 
especially  experimentation,  is  needed  in  order  to  validate 
these  results  and  to  determine  what  other  non-ability 
variables  5  if  any,  influence  the  outcome  of  intercollegiate 
debates.   While  essentially  descriptive  studies  such  as 
this  one  can  be  of  great  value  in  identifying  problem 
areas,  the  capacity  for  testing  solutions  to  those 
problems  is  uniquely  a  function  of  experimental  research. 

While  the  judges'  perception  of  the  sex  of  the 
debaters  would  be  difficult  to  manipulate  experimentally, 
both  prestige  and  proximity  would  be ,  at  least  partially, 
susceptible  of  such  manipulation.   Experimentation 
which  holds  performance  constant  while  varying  the 
description  of  the  source  has  a  long  history  and 
extensive  literature.   Techniques  should  be  available 
which  would  allow  the  manipulation  of  prestige  and 
proximity  as  independent  variables. 

The  competitive  nature  of  forensics  has ,  to  this 
point  in  time,  served  to  block  any  experimental  treatments 
in  situ.   Tliis  must  be  overcome.   The  use  of  transcripts, 
video-taped  debates,  sophomore  speech  classes,  etc., 
is  of  some  value.   But  such  vehicles  are  a  poor  substitute 
for  the  actual  tournament  situation. 

Further  study  is  needed  of  the  component  elements 
of  the  rater  variable  prestige.   This  study  operationalized 
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that  variable  as, a  function  of  N.D.T.  participation. 
Such  a  definition  may  be  either  inaccurate  or  incomplete. 
Only  additional  research  can  provide  a  comprehensive 
definition,  and  only  experimental  research  can  confirm 
the  effects  on  the  outcome  of  intercollegiate  debates. 
At  present, debate  tournaments  rely  primarily  upon 
win-loss  records  and  speaker  ratings  as  measures  of 
performance.   The  evidence  of  this  research  suggests 
that  these  measures  are  biased  by  the  rater  variables 
studied.   Some  new  dependent  measure  is  needed.   The 
principal  component  analysis  utilized  here  represented 
an  attempt  to  develop  such  a  measure.   The  results 
suggested,  however,  the  possibility  that  factor  scores 
tended  to  mask  differences  which  did  exist.   Research 
is  needed,  first,  to  determine  the  proper  bases  for 
evaluation  in  debate  and,  second,  to  devise  some 
overall  measure  which  will  serve  as  a  valid  and  reliable 
index  of  outcome. 

A  number  of  statistical  techniques  are  available 
which  might  aid  in  the  search  for  such  a  measure.   Factor 
analysis  should  not  be  abandoned  on  the  basis  of  this 
study.   It  merits  further  and  intensive  examination. 
Various  correlation  techniques,  if  studied,  might  lead 
to  the  development  of  some  index  eiipressed  in  terms  of 
its  relationship  to  current  measures. 
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This  study  presents  evidence  of  the  validity  of 
the  debate  format  currently  in  use.   The  evidences  of 
rater  bias  which  did  emerge  are  most  likely  beyond  the 
control  of  the  average  tournament  director.   There  are, 
however,  some  possible  solutions  suggested  by  the 
results  of  this  study. 

Given  access  to  high-speed  computing  facilities, 
individual  and  team  scores  on  the  present  ballot 
could  be  adjusted  on  the  basis  of  the  statistical  models 
developed  here.   This  would  compensate  for  the  effects 
of  rater  bias.   In  effect,  it  would  involve  the  creation 
of  a  handicapping  procedure  for  debate.   The  number  of 
possible  models,  and  their  complexity,  would  demand 
automatic  computation  to  avoid  unreasonable  delays. 
While  this  solution  seems  logically  based,  it  would 
probably  be  received  with  great  resistance  and  hostility 
on  the  debate  circuit.   Coaches  and  debaters  would 
probably  resent,  and  be  suspicious  of,  any  "tinkering"  with 
their  scores.   Acceptance  of  such  a  solution  would  depend 
upon  absolute  proof  that  rater  bias  was  affecting  out- 
come.  Such  proof  can  only  come  through  experimental 
research.   Thus  the  need  for  more  experimentation! 

Given  sufficient  funds,  "outside"  judges  could  be 
hired.   It  would  be  possible  to  bring  in  debate  coaches 
from  different  areas  of  the  country  who  do  not  have 
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teams  participating  in  the  tournament.   This  has  been 
suggested  for  the  various  N.D.T.  district  tournaments, 
but  no  substantive  action  has  been  taken.   Expense 
remains  a  major  barrier.   Outside,  non-debate,  judges 
are  an  additional  possibility,   Local  business  and 
professional  persons,  faculty  from  the  host  school, 
etc.,  could  be  recruited  to  serve  as  judges.   It 
cannot  be  assumed,  however,  that  rater  bias  would  not 
affect  these  judges  just  as  it  appears  to  affect 
coaches  who  now  serve  as  judges.   Research  would  be 
needed  to  determine  whether  or  not  rater  bias  would 
operate  among  such  a  group. 

The  ultimate  solution  to  the  problems  of  rater 
bias  must  be  found  in  the  judges  who  render  the  decisions. 
Better  training  for  debate  coaches,  emphasizing  the 
proper  basis  for  awarding  decisions,  would  be  of  value, 
Tliis  is  an  area  in  which  the  proposed  Developmental 
Conference  on  Forensics  should  act.   Such  a  conference 
could  and  should  propose  specific  standards  of  competence 
for  debate  coaches.   It  could  also  initiate  and  support 
financially  the  research  needed  to  determine  and 
empirically  verify  a  set  of  criteria  for  evaluating 
debate . 

Some  corrective  influence  would  probably  be 
generated  by  publicity  of  the  results  of  this  and 
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similar  studies.   Rater  bias  will  not  go  away  simply 
because  it  is  ignored.   The  opening  assembly  of  every 
debate  tournament  should  include  a  discussion  of  rater 
variables.   Biases  should  be  identified  specifically, 
and  judges  should  be  encouraged  to  be  sensitive  to  the 
influence  of  such  biases. 

A  stronger,  more  precise  statement  of  professional 
ethics  might  be  useful  in  conjunction  with  this  practice, 
Some  procedure  for  investigating  and  eliminating 
consistently  biased  judges  might  have  beneficial 
results.   All  of  these  are  areas  in  which  the  American 
Forensics  Association  can  and  should  exercise  its 
leadership. 
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